Relative Scraping

erva · January 17, 2018, 6:42pm

Hi,

I have a problem when trying to get values from PDF document. When trying to use Get Text activity, whole PDF document gets selected (blue), so theres no elements to select for numbers.

I have used Citrix recordings Relative scraping. It works fine, but in PDF document is a table, and the numbers what i want are at below table. If table rows remain same, getting value works fine. But if number of table rows change, it gets values from wrong place, from table. So Get OCR text activity doesn’t sync with Find Image activitys coordinates. I have attached pic from PDF. I use Find Image activitys “image” words"KELA tilausva…4,5%" from left side and the value where Get OCR is pointed is on right side “-2.03”

Set Clipping Region activity has Direction set to “TRANSLATE” and Size is (-957, 0, 832, 0)

In picture table has only one row but other reports might have them 10 or more…

-mikko

Dave · January 17, 2018, 7:20pm

If at all possible, try to get the report in a different format as that’d be your best course of action.

Since it isn’t recognizing the table, have you tried using the ‘Read PDF Text’ activity? If that doesn’t work, then have you tried the ‘Read PDF With OCR’ activity?

Both of these will output a single string of the PDF (or portion of the PDF, if you change the input range). Then you can use string manipulation to find your value.

You’ll definitely need to play around with the settings if you need to use OCR, & it won’t be 100% accurate. The text looks pretty standardized though, so it might end up ok as long as the alternating white/gray doesn’t screw things up.

erva · January 17, 2018, 8:19pm

Got it working. Recorded it again and it started working

I tried Read PDF Text and Read PDF with OCR activity. I think that it might be better solution than messing up with Acrobat. Just need to study string manipulation, maybe in UiPath forum has some threads about it.

Thanks Dave for help!

-mikko

Topic		Replies	Views
Values from pdf Help pdf , activities	10	2176	November 6, 2017
Read Text from Specific Region Activities pdf , activities , question	7	1003	November 14, 2022
Retrievinbg information from a acanned pdf Help activities , data_scraping	19	2170	August 18, 2017
Unable to fetch the value from the PDF Activities pdf	12	1020	May 8, 2022
Using Get Text Activity On Reading a PDF does not extract the real information from PDF Help activities	2	909	April 6, 2019

Relative Scraping

Related topics