Read PDF file and replace the text next to particular text and save as PDF again

Sathish_Kumar_S · March 22, 2024, 2:34am

Requirement :

Read PDF files one by one
Capture the PDF File name
3.Replace the text next to “Credit Card number” text with *****
save as PDF again (same file name)

PS: Attached Sample PDF
PDFSample1.pdf (206.0 KB)

@Shiva_Nikhil

Vikas_M · March 22, 2024, 4:52am

Hi @Sathish_Kumar_S

I’ve successfully replicated your scenario and generated the solution file. You can find it attached below.

Read PDF file and replace the text next to particular text and save as PDF again.zip (307.0 KB)

Here’s a breakdown of the steps I took to create the workflow:

Determined the page count of the PDF.
Split the PDF into individual pages and processed each one separately.
Used regular expressions to replace credit card number with “*****”
Saved the modified content as a text document.
Used Word Activities to convert the text document back into a PDF format.
Merged all the individual PDF pages back together.
Stored the final result in the OUTPUT folder.

Hope it helps you out!

sanjay3 · March 22, 2024, 6:01am

Hi @Sathish_Kumar_S

Sequence
PDFFilePath = “path_to_your_PDF_file”
PDFContent = Read PDF Text (PDFFilePath)
FileNameWithoutExtension = Path.GetFileNameWithoutExtension(PDFFilePath)
ModifiedContent = Regex.Replace(PDFContent, “(?<=Credit Card number:\s*)(\d{4}\s\d{4}\s\d{4}\s\d{4})”, “**** **** **** ****”)
Write Text File (FileNameWithoutExtension + “_Modified.pdf”, ModifiedContent)

Sathish_Kumar_S · March 22, 2024, 7:33am

Sorry the input pdf file is Scanned PDF… Is this logic will work?

Sathish_Kumar_S · March 22, 2024, 7:56am

I am able to read scanned pdf using read pdf ocr… but the format is changing after saved as text file.

Is it possible to keep the output PDF as same as input pdf file format?

Vikas_M · March 22, 2024, 9:58am

Hi @Sathish_Kumar_S

Yes you can implement it using "Read PDF with OCR " activity

Please find the below workflow updated according to your requirement
Read PDF file and replace the text next to particular text and save as PDF again.zip (429.2 KB)

Hope it helps you out!

system · March 25, 2024, 9:58am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Get and Replace Text From PDF To Word File Studio uiautomation	1	95	May 20, 2024
How read text date on PDF document for new rename files Activities uiautomation , pdf , activities , studio , studiox , question , rename	1	973	April 22, 2022
Regex to find the particular word and replace all the words next to it with ***** Studio uiautomation	6	194	March 22, 2024
How to read from multiple PDF files Help uiautomation , studio	1	1448	January 29, 2018
Pdf automation with font style Studio uiautomation	1	784	March 4, 2022

Read PDF file and replace the text next to particular text and save as PDF again

Related topics