Getting Specific texts using OCR from Invoice

manojj.yadav · May 16, 2022, 9:44am

Hi, I need to get a text from an invoice pdf using OCR activity.
The OCR method completely scraps the entire page, But I need a specific text.
here is the invoice -
Centrix_try1.pdf (303.6 KB)

Is there a method to get the specific text (similar to the anchor base)?
Pressing F3 and selecting the area works only when the text is in the same place in all pdfs, the other option I have is using regex (I don’t know regex yet), so is there any other method? Thank you.

Rahul_Unnikrishnan · May 16, 2022, 9:49am

Hello @manojj.yadav ,

This can be done in different methods.

Using regex- Use Read PDF activity and get the datat to a string. Then use regex to extract it. This will be helpful if you need to extract from a set of pdf and the position is getting changed.
Open the pdf and use Get Text activity- Here you need to open the pdf with a pdf reader and need to extract the values. You can use CV also here.
Document understanding- iF your document is not structered prefering to go with this method. There are predefined ML mdoels available in Uipath to extract the values. You can train the model.

Please confirm which are the field that you want to extract and is the position of the label remains static.

manojj.yadav · May 16, 2022, 9:51am

So if the positions are not static, the other two options are ML model or regex only?

Rahul_Unnikrishnan · May 16, 2022, 9:54am

@manojj.yadav else using Get Text you have to give proper anchor.

For eg: if there is label called Name and you want to extract the Name value. Then in the selector of Get Text you need to add the anchor to Name label.

Shyam_Pragash · May 17, 2022, 12:13pm

Hi @manojj.yadav

use Get text activity and use split to extract the account
as it as follows other also.

Thank
Shyam

system · October 14, 2022, 6:15am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to extract data from pdf Help selector , uiautomation , activities , question	10	1120	February 5, 2020
Pdf data extraction for specific element Help pdf , activities , question	6	1759	April 17, 2021
Read Specific Data From PDF Help	19	2405	September 24, 2019
How to get desired text from a document Studio uiautomation	33	475	October 30, 2023
How to get the specific data from the pdf using ocr Help studio	10	5408	June 1, 2019

Most Active Users - Yesterday
prashant1603765
yedukondaluaregala
ashokkarale
sharazkm32
mively
sonaliaggarwal47
VanjaV
pikorpa
singh_sumit
David_Hernandez2
More details...

Getting Specific texts using OCR from Invoice

Related topics