Extract number from pdf

tobschroer · June 25, 2018, 10:00am

Hi all,
i have to extract a specific number from PDFs that all have the same layout.
However the PDFs have no specific UI elements. So what is the best option to extract the PDF?

Thanks
Tobi

Rishabh_Lakhera · June 25, 2018, 10:01am

Hey @tobschroer

You can use get OCR text or Screen scraping

Refer to these links:

tobschroer · June 25, 2018, 10:05am

Thanks @Rishabh_Lakhera.
I have tried it with the get text Action. But my message box is always empty.

Rishabh_Lakhera · June 25, 2018, 10:08am

Try Screen Scraping and play around with the ocr,fulltext and native options!
Always works for me

tobschroer · June 25, 2018, 11:13am

@Rishabh_Lakhera . It´s working right now. But only once. When the PDF isn´t open in the background a workflow exception appears and if i close the PDF, open it a second time and then start the workflow i get totally other Outputs in my messsage box. Do you have any idea?
Thanks

Sourav_Anand · June 25, 2018, 12:06pm

If the PDF layout is fixed, We can use get PDF text to get all PDF content into a string variable. Can try using string manipulation for extracting the required Number.

PAD · June 25, 2018, 12:22pm

Hi @tobschroer,
Perhaps try to change the scale in your Screen Scraper Wizard - e.g. when I need to OCR scrape a more complex text (that includes also capital letters and dots in dates), I increase the scale to 5. Does this solution help?

Rishabh_Lakhera · June 25, 2018, 1:15pm

Check this out

Topic		Replies	Views
Text aus Bild Studio pdf , question , pdf-extraction	6	145	May 7, 2026
How to get only numbers from PDF file? Help pdf , ocr , activities	8	12883	May 8, 2018
Extract number from pdf without elements to click on Help	8	1617	November 15, 2018
Unable to extract specific elements & Selector doesn't show the elements I need Help activities , studio	7	3529	June 1, 2019
Pdf data extraction for specific element Help pdf , activities , question	6	1852	April 17, 2021

Extract number from pdf

Related topics