I am trying to 1. Read the PDF invoice (By scrapping) files one by one and 2. Check the extracted invoice number in the excelsheet. 3. After getting the exact match I want to read the same row from starting to end. 4. Next go to next pdf and open read the invoice and so on…
Actually PDF Automation itself is done in background.
And in your scenario which you told now will be somehow like that you have to always keep open the particular PDF from which you want to extract data.
Sometimes it may fail also if the window is in small size so the Invoice no part will gets hide behind.
So in such scenario it may fail.
Hope this may help to solve your query
Definetly mark as solution & like it.
Hello @Pratik_Wavhal: First thanks a lot for your reply.
No, PDF should open (When you run the workflow/start process/etc…) one by one and perform the compare in excel, then open second PDF then check in excel. I need to do for only 2 PDF files.
I tried a lot for the steps as you mention. But for pdf, selector is not working for Get text activity. Actually the selector is getting static so its only working for any one pdf by using Acrobat Reader. No such tags are getting in the UI Explorer which will make it dynamic.
Even if you do it by OCR atlast you will have to use regex only to extract the Invoice No.
So its best to do by using Read PDF Activity.
One thing can be happen that Just to show the Invoice is opening in PDF you can do it to show by using Start Process Activity but atlast search the invoiceNo can be done by using the method that i have already implemented.
Hope this may help to solve your query
Definetly mark as solution & like it.
First of all thnx for providing the selector that works 100% perfect.
May i know from where you got the extra tag or the idea which tag must be der somewhere that will help us to get the exact position of Invoice No from Invoice PDF Doc ??
Below is the perfectly working files which meets all your requirements. Just have a look to it. temp.zip (64.9 KB)
So you can mark this as a solution now and like it.like it.
@Pratik_Wavhal Invoice document are Structured document. Correct
And the pdf file which you have provided are not scaned copped. so we can use Get text activitie to capture the text. When you use get text activity you will be getting thie selector, when u click on invoice no. and when you open new invoice the position of the invoice no will be same. but number will be changing. so he put * like ctrl name=‘*’
But the Selector which @sachinsm gave me afterwards includes “idx=‘2’” which i was not getting in my UI Explorer. I am talking about this tag only.
Otherwise the entire selector i was also getting and same i also did using * as wildcard to make it dynamic but after making it dynamic and when i do the validation for that particular selector in which “idx=‘2’” was not present then at that time it was indicating to the date which is beside Invoice No. So that was wrong.
Thats why i asked that how @sachinsm consider “idx=‘2’” should be include in the selector and so on by which it will work perfectly.
Btw Thnx bro for explaining the useful stuff again.
Yeah. Thats what i want to get know how we can do it if any tag is not present then on what basis or the idea which tag must be der somewhere that will help us to get the value we want to extract.
If you are aware then let me know