How to extract data from digitize pdf

Smitesh_Aher2 · March 28, 2025, 6:25am

Hi Team,

I want to extract data from digitize pdf So, please help me by using which activity we can extract data correctly.
As already i used tesseract ocr, omnipage ocr, read pdf text activity but data is not extracted properly by using this activity.
Please let me know which other activity we can use to extract data properly.

Binit_Sarkar · March 28, 2025, 6:41am

Hi @Smitesh_Aher2
The approach that you are using should have worked technically.
Could you also try with just read pdf text, without using any OCR engines maybe.
See if this can help you.
Probably after the extraction you can use regex to get what you want.

If this still does not help, please do attach an informative screenshot for helping you further.

Hope this helps.
Please mark it as a solution if it resolves your issue.
Happy Learning

prashant1603765 · March 28, 2025, 6:42am

Hi @Smitesh_Aher2

Please use the Document Understatnding:

Please follow the link,

If you found helpful, mark as a solution
Happy Automation

singh_sumit · March 28, 2025, 6:44am

Hey @Smitesh_Aher2 Try with Document Understanding method. and can you show the document sample.so that way we could help you more .

Prashanth_D · March 28, 2025, 6:48am

Hi @Smitesh_Aher2,

If Read PDF Text from a Native (Digital) Document doesn’t give you a proper output then you can also try with UI based automation on PDF.

Refer to the following YouTube Video - https://www.youtube.com/watch?v=AetgInrwM1s

Cheers!

Topic		Replies	Views
Unable to extract specific data from scanned pdf Help pdf , activities , question	6	1102	January 24, 2020
Need help data extraction in PDF Invoice Help activities	0	879	August 28, 2019
How to get table from invoice Help activities	10	2123	February 24, 2021
I need to extract all the details from invoices pdf and line item describtion quantity and all the fields and i need to do this for all pdf files in the folder Studio studio , question , activities_panel	23	3181	June 30, 2021
How to extract tabular data from an invoice with uipath activity Activities pdf	4	1067	August 31, 2022

How to extract data from digitize pdf

Related topics