Pdf extraxction

Shashi123 · August 16, 2018, 10:36am

I am trying to extract the pdf data.Sample Invoice B.pdf (365.4 KB)

Same file is used in the videos of uipath academy. In the videos the pdf data are extracted using get text and selectors are getting generated. But when I am trying selectors are not getting generated. I have only one option OCR

PetrBaudis · August 29, 2018, 12:17am

Hi @Shashi123! You bring up a good point, the tutorial will not work on OCR out of the box. You need to run your PDFs through OCR tools that add a text layer, like Tesseract or Abbyy. However, reliable selectors will still be difficult to create.

A good alternative for OCRs is to use a cognitive data capture service to find data - it requires no explicit rule setup but rather uses AI to identify information, which is much better setup here. There’s a couple, one tutorial for UIPath to take a look at: Data extraction from invoices - Rossum and UiPath | Rossum

CBlanchard · August 29, 2018, 12:29am

Hi Shashi,

Try here: Selecting PDF Elements - #4

Some settings may need to be changed within adobe reader. It’ll work once you follow the solution here .

Topic		Replies	Views
Get Text from PDF Academy Feedback	19	5975	January 23, 2019
PDF extraction - identify selection Help	7	928	August 13, 2019
Extract specific data from Scanned pdf Academy Feedback studio	2	2818	October 4, 2019
Invioce Pdf Automation Studio datatable , activities	11	1882	March 13, 2020
Variable selector - pdf files Activities pdf	9	1882	December 22, 2020

Most Active Users - Yesterday
Anil_G
Eric_Alvarado
dokumentor
Nafissa_Al_Abida
A_Learner
anthony.gray
ashokkarale
SorenB
extern.davi.pristo
More details...

Pdf extraxction

Related Topics