Retrievinbg information from a acanned pdf

scraping
activities

#1

Hello eveery body,

i’m trying to get informain from a scanned pdf 'Total TTC= 300.500.
i had tryed read pdf wirh OCR, creen scapping but it did not work:

please find attached a screenshot of the pdf.


#2

Hi,

I would say one of the simplest options is to use Citrix recorder -> scrape relative -> capture the bottom right corner -> Indicate in a little larger area than the total appears.

Works fine from my side using your test image.
OCR has much better results in smaller areas.

Cheers


#3

Thank you Florent,

it works now, i want to use start process to befor the scrape relative in order to open the pdf, but it crashes there is a message about the clipping area:
Translate clipping region : The target Element was not specified for this activity. You should set its Target property or use this activity inside of a scope activity (Attach Browser, Open Browser, Open Application, Attach Window, Get Active Window, Element Scope activities).


#4

Hi,
Create a shortcut key for pdf then use send hotkey activity.


#5

How can I use scrape relative for two different scanned pdf’s with same format??..it works for 1pdf bt not the other…


#6

Yep it works if both are same format.
What’s the error your getting?


#7

Its an selector not found exception…!
how could i solve this…?


#8

check with UIexplorer that those selectors are valid or not.


#9

Hi,
Make sure use wildcard * for dynamic attribute.
Use UiExplorer to find out the most reliable selector.
Try to add parent class,id,etc avoid idx if more than 2, aaname if dynamic.
Could you please attach selector screenshot.


#10

Attach Window ‘082620161 Applicati’ : Cannot find the UI element corresponding to this selector:


#11

Can’t say much could you please paste your attach window selector attribute.


#12

#13

#14

I don’t see any.
Upload screen shot of selector else use preformatted text option while paste html tag.
preformatted text


#15

<wnd app='applicationframehost.exe' cls='ApplicationFrameWindow' title='*.pdf and 1 more page ‎- Microsoft Edge' />Preformatted text


#16

is it still not available?


#17

Nope.


#18

Should be now.
Bb code tags work too.


#19

Is that complete selector ?
are you opening pdf in Edge?


#20

Yes…!!