Extracting Multiple Text from a PDF

Muhammad_Anas_Baloch · January 13, 2024, 11:59am

Hello UiPath Community ,

I hope you’re all doing well! I’ve got a bit of a challenge and could use some guidance. I need to extract multiple texts from PDF files and store them neatly in an Excel sheet.

Are there any seasoned automators who can share some wisdom or point me in the right direction? Your expertise would be a game-changer for me!

Thanks a bunch in advance!

vrdabberu · January 13, 2024, 12:32pm

Hi @Muhammad_Anas_Baloch

->Use Read PDF with OCR if it’s an scanned copy files and If they are normal PDF’s then you can use the Read PDF Text activity.
->Use the Matches activity and pass the Regex Expression and the output of the Read PDF activity so that you need to get the LineItems.
->Use the for each activity and iterate through each LineItem and use the regex expression on the CurrentItem.
->After Extracting the data by using the Regex Expressions, you can store the extracted data in respective variables.
->You can use the write cell activity and then you can send the extracted data into respective cell’s.

Regards

mkankatala · January 13, 2024, 12:32pm

Hi @Muhammad_Anas_Baloch

→ For extracting the text from the PDF’s, you have to use the Read PDF text activity for the structured documents and read pdf with OCR activity for the unstructured documents.
→ After that the text is stored in a string datatype variable, use the regular expressions to extract the required text.
→ After that use the Excel activities to write the extracted text to excel.

Or

If you are interested in document understanding. You can use the document understanding and AI center to extract the required data.

Hope it helps!!

Topic		Replies	Views
How to extract multiple data from PDF Academic Alliance question	28	5590	August 22, 2020
How to extract multiple text details and table info from PDF file Studio	6	480	October 31, 2023
Read PDF and Extract Text to Excel using Regex Activities excel , pdf , question	6	93	October 1, 2024
Extract Specific text from multiple Pdf's Studio studio , question , activities_panel	4	540	November 21, 2023
How can you extract data from many pdfs into a excel Activities excel , pdf , question , document_understanding	2	1360	March 7, 2022

Most Active Users - Yesterday
prashant1603765
yedukondaluaregala
ashokkarale
sharazkm32
mively
sonaliaggarwal47
VanjaV
pikorpa
singh_sumit
David_Hernandez2
More details...

Extracting Multiple Text from a PDF

Related topics