Extract PDF without OCR

monikanimbalkar · January 14, 2020, 5:01am

i am trying to extract to multiple PDF without using OCR activity.
refer the screenshot of pdf and workflow.
please help me.
thank you.

ashley11 · January 14, 2020, 5:38am

If your PDF is digitized then you can use uipath.pdf.activities

and use read pdf text activity.
this will return the text out of pdf.
after that u can use RegEx to extract specific data
if your PDF is scanned document then you have to go for ocr.

Vivek.A.S · January 14, 2020, 6:31am

Hey @monikanimbalkar, you got a solution or not?

monikanimbalkar · January 14, 2020, 6:36am

No . i am trying to RegEx activity.

Palaniyappan · January 14, 2020, 6:37am

Yah
We can use normal READ PDF ACTIVITY and get the output with a variable of type string and then we can use either Regex or Spilt method to get the string we want
Cheers @monikanimbalkar

Vivek.A.S · January 14, 2020, 6:38am

Which field you want in the PDF … tell me i will help you… if possible Share the PDF

monikanimbalkar · January 14, 2020, 6:50am

Invoice To:
Despatch To
Voucher No
Dated
5)Description of Goods
6)QuantityP.o.no-15366 for our Ambavadi SRA-02 Project…pdf (24.8 KB)

Vivek.A.S · January 14, 2020, 9:05am

Hi @monikanimbalkar,

Run the attached file and refer Output screenshot is given below.

P.o.no-15366 for our Ambavadi SRA-02 Project…pdf (24.8 KB) PDF_Output.txt (1.5 KB) READING PDF.xaml (6.4 KB)

Vivek.A.S · January 14, 2020, 9:35am

in Activities Pannel, Type PDF – if its found use Read PDF activity else Click the **Search in available packages** & Install the UiPath.PDF.Activities Package. Please refer the screenshot:

monikanimbalkar · January 14, 2020, 11:04am

Thanks @Vivek.A.S

can you explain me bit about regex that you used in workflow for voucher no,dated and description of goods ?

Topic		Replies	Views
Extract data from pdf document Help pdf , activities , question	18	2058	February 3, 2020
How to read the specific data in pdf Activities pdf , activities , question	33	4936	June 2, 2021
Pdf Extract from OCR Text Task Capture	4	1664	August 15, 2020
Pdf automation without OCR Learning Hub pdf , string , reframework	9	992	July 10, 2020
How to read PDF without using OCR Help	4	1959	May 22, 2018

Extract PDF without OCR

Related topics