NEED HELP Invoice's Extraction

krishna_priya1 · January 5, 2023, 12:50pm

Hi All
I’m working on Invoice extraction in multiple type of invoices
in this process I’m want to extract the details in PDF is (Description, Price, Quantity)
but in many PDF there is a alternative word are there in the PDF (Example: for Quantity - qty - qte )

So, I can’t extract the quantity in pdf

THIS R MY WORKFLOW

This r My input PDF files

**

This is the output I Got ********* below

Kaviyarasu_N · January 5, 2023, 12:54pm

Hi @krishna_priya1
To extract details such as the description, price, and quantity from a PDF invoice, you can use the

Read PDF with OCR activity in UiPath to extract the text from the PDF
Use the Find Text Position activity to locate the specific words or phrases you are looking for.

Regards,
Kaviyarasu N

krishna_priya1 · January 5, 2023, 1:00pm

If I can use the Read PDF with OCR I can Extract the details from one type of Invoice only,
Actually
this pdf I just create from to post this image in the UiPath forum I can’t share my Invoice’s,
Actually my work is to extract the data from the different type of Invoice’s

for example in PDF’s (I want to extract the Quantity value QTY, qte, quantite) so I can’t extract the exact value of the quantity by use the Document understanding.

krishna_priya1 · January 9, 2023, 6:16am

Hi anyone help me clear this error…

Anil_G · January 9, 2023, 6:49am

@krishna_priya1

If all the column will have data and if the column positions are same…then you can use read pdf as text and then use split activity to get the table text then pass the text to generate datatable and give the column and row separators and as the column places are fixed you can use the index instead of name

To Extract only the table text try identifying a static string just before table and after table in the pdf

cheers

krishna_priya1 · January 9, 2023, 12:29pm

Bro actually my input file’s is different Above mention file are not my input file these file I just create for Posting Question in UiPath forum.

Actually in my input file’s
I need to extract the Description, Prices, and Quantity

but the issue is I can’t extract the Quantity value bczzzz
IN some INVOIVES’S Quantity replace QTY, QTE, QUANTITE so I can’t extract the value

LIKE…

PLZ can you help me clear this issue’s

This are my Taxonomy Manager screenshots

Anil_G · January 9, 2023, 12:51pm

@krishna_priya1

Even giving multiple possible values is not taking the correct output?

Cheers

krishna_priya1 · January 9, 2023, 1:00pm

YES BRO

Result screenshot

can’t extract the Quantity value

s0biesky · January 9, 2023, 1:09pm

Hi @krishna_priya1, use read pdf, save the output in a txt file, paste the content in regexstorm.net and start making the regex patterns. Working with tables isn’t always easy, you may need to remove the “junk” from the invoice before extracting the actual relevant data.

Anil_G · January 9, 2023, 1:15pm

@krishna_priya1

Try building different documents or else…and check which is present and use switch case to extract from different quantity columns…that might work as well

Cheers

krishna_priya1 · January 10, 2023, 4:53am

Is there any other option to extract the Quantity…

krishna_priya1 · January 10, 2023, 5:53am

Bro can you plz Explain where I use Switch case for Quantity in my workflow
above I posted my workflow Images

Anil_G · January 10, 2023, 6:50am

@krishna_priya1

In thw workflow before sending it to extraction use a switch case and before that read the pdf and check if it has quantity or qte and seggregate the document with switch case and use multiple extractions each for one type

Cheers

Topic		Replies	Views
NEED HELP : Invoice's Extraction Studio studio , question	2	497	January 5, 2023
Read PDF text, write specific information Help pdf , activities , regex , question , data_manipulation	3	894	November 19, 2019
Extract PDF Invoice Studio	1	812	April 27, 2020
Invoice Data Extraction Help excel , uiautomation , studio	12	4720	December 3, 2018
Different Vendor Invoices Studio datatable , excel , pdf , activities , studio , question , word , pdf-extraction , emailtopdf	5	727	March 24, 2023

NEED HELP Invoice's Extraction

Related topics