NEED HELP Invoice's Extraction

Hi All
I’m working on Invoice extraction in multiple type of invoices
in this process I’m want to extract the details in PDF is (Description, Price, Quantity)
but in many PDF there is a alternative word are there in the PDF (Example: for Quantity - qty - qte )

So, I can’t extract the quantity in pdf

THIS R MY WORKFLOW




This r My input PDF files

**


**

This is the output I Got ********* below

Hi @krishna_priya1
To extract details such as the description, price, and quantity from a PDF invoice, you can use the

  1. Read PDF with OCR activity in UiPath to extract the text from the PDF
  2. Use the Find Text Position activity to locate the specific words or phrases you are looking for.

Regards,
Kaviyarasu N

If I can use the Read PDF with OCR I can Extract the details from one type of Invoice only,
Actually
this pdf I just create from to post this image in the UiPath forum I can’t share my Invoice’s,
Actually my work is to extract the data from the different type of Invoice’s

for example in PDF’s (I want to extract the Quantity value QTY, qte, quantite) so I can’t extract the exact value of the quantity by use the Document understanding.

Hi anyone help me clear this error…

@krishna_priya1

If all the column will have data and if the column positions are same…then you can use read pdf as text and then use split activity to get the table text then pass the text to generate datatable and give the column and row separators and as the column places are fixed you can use the index instead of name

To Extract only the table text try identifying a static string just before table and after table in the pdf

cheers

Bro actually my input file’s is different Above mention file are not my input file these file I just create for Posting Question in UiPath forum.

Actually in my input file’s
I need to extract the Description, Prices, and Quantity

but the issue is I can’t extract the Quantity value bczzzz
IN some INVOIVES’S Quantity replace QTY, QTE, QUANTITE so I can’t extract the value

LIKE…

PLZ can you help me clear this issue’s

This are my Taxonomy Manager screenshots

@krishna_priya1

Even giving multiple possible values is not taking the correct output?

Cheers

YES BRO

Result screenshot

can’t extract the Quantity value

Hi @krishna_priya1, use read pdf, save the output in a txt file, paste the content in regexstorm.net and start making the regex patterns. Working with tables isn’t always easy, you may need to remove the “junk” from the invoice before extracting the actual relevant data.

@krishna_priya1

Try building different documents or else…and check which is present and use switch case to extract from different quantity columns…that might work as well

Cheers

Is there any other option to extract the Quantity…

Bro can you plz Explain where I use Switch case for Quantity in my workflow
above I posted my workflow Images :point_up_2: :point_up_2: :point_up_2:

@krishna_priya1

In thw workflow before sending it to extraction use a switch case and before that read the pdf and check if it has quantity or qte and seggregate the document with switch case and use multiple extractions each for one type

Cheers