Scraping different kinds of images

mario · August 28, 2017, 8:13pm

Has anyone experience with scraping different invoices (images) and then setting the rules to read varibales from it? The problem is that there are 15 different kinds of invoices and these variables are on different areas on each invoice. Is there a way how the robot recognizes on which kind of invoice it is? Lets say based on what it looks like. Or the only way is to identify key words?

Ganga_Bharani · August 29, 2017, 10:29am

I maybe wrong. But just give this a try.

for every type of PDF that is visually similar, you can have a unique identifier (which can be the same image at different locations or completely different images in each or texts)
For an example, if we have two types. In one, we have the logo at the left top and the other right top. Here we can find image-> imagefound.GetAbsolutePosition() can be used as parameters to differentiate between the two PDF types.

mario · August 29, 2017, 11:13am

Thank you very much.

The problem is that there is only text on them. But the form is very different.

Ganga_Bharani · August 29, 2017, 11:46am

Did you try using unique form element as an identifier?

mario · August 29, 2017, 12:18pm

No I have not. WHat is that?

Cosin · August 31, 2017, 12:22pm

I’m pretty sure thare are parts of the invoices that are similar enough to find the “type”. But it’s very hard to tell without examples. Could you upload 2-3 examples of a single type of invoice?

Ganga_Bharani · August 31, 2017, 12:35pm

Unique form element can be anything that is unique to a particular form.
The first question in one type can be name and it can be the unique identifier.
In another type the first question could be Age.

So you will have to analyse the PDFs and first find an identifier before trying to automate.

Topic		Replies	Views
Finding elements from the multiple PDFs and perform activities depending upon elements Help	15	1262	April 13, 2019
OCR Invoices data extraction and analysis Help	4	1115	July 17, 2019
Invoice comparison Help studio , question	13	1124	December 18, 2020
Read different invoices - get specific values Help	2	2005	April 8, 2019
Extract data with different Names Studio studio	6	1116	August 18, 2020

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

Scraping different kinds of images

Related Topics