Non-searchable PDF to searchable PDF without the use of 3rd party app

MTS · June 24, 2019, 9:32am

Hi everyone,

One of the steps in a new business proces we are about the automate is the conversion of a non-searchable pdf to a searchable one. Is there a way to do this without the use of any third party application (like Adobe Acrobat Reader DC)? My first gues was to use the OCR activity but this gives back a string, which I cannot export to a PDF. We alreay experimented with Acrobat DC but this is not the finest application to use in combination with UiPath (same issues as already desribed on this forum too).

Thanks a lot!

Serena · August 3, 2019, 4:54pm

This custom activity package can help you convert non-searchable pdf to a searchable one:
https://connect.uipath.com/community/project/pua-virtual-acrobat-dc-pdf-activities.

The prerequisite is that you need to have Acrobat Pro DC installed on your Robot machine.

MTS · August 20, 2019, 1:11pm

Thank you Serena. How can I install this in UiPath. I don’t the package

Serena · August 24, 2019, 4:40am

You can download the package from here:
https://github.com/s3r3n3/PDF_Activties

sonaliaggarwal47 · April 19, 2021, 6:20pm

Hi @MTS,

To make your pdf searchable using uipath, please follow the below steps:

Read pdf with OCR
Save extracted data from this activity.
Use invoke code activity.
Write below c# code to place extracted data from scanned pdf into pdf’s “Keywords” section. Once done, this will make the pdf searchable using the keywords present in pdf’s “keywords” section.

var doc = new Document();
string path = “”;
PdfReader reader = new PdfReader(path+“”);
PdfStamper stamper = new PdfStamper(reader, new FileStream(path+“”, FileMode.Create));
var info = reader.Info;
info[“Keywords”] =pdfText; where pdfText is the variable that holds the data extracted using step1
stamper.MoreInfo = info;
stamper.FormFlattening = true;
stamper.Close();
insertedWordCount = info[“Keywords”].Length;

Also, you will need to import namespace - iTextSharp.text.pdf and iTextSharp.text.xml.xmp

Hope this helps.

Regards
Sonali

Topic		Replies	Views
Convert a pdf to a searchable pdf Activities pdf , activities , question , document_understanding , windows , pdf-conversion	12	1189	March 21, 2025
OCR Without Extracting Data Help activities	2	902	March 7, 2019
Extracting data through pdf using ocr and store in pdf uipath Help pdf , ocr , activities	14	5420	November 16, 2022
PDF Keyword search Help	4	1422	December 15, 2019
UiPath not finding text box in Find feature (Ctrl + F) Help studio	17	2685	July 15, 2019

Non-searchable PDF to searchable PDF without the use of 3rd party app

Related topics