Need help in PDF extraction using Document Understanding

Learner007 · November 18, 2022, 5:14pm

Hi Everyone, I have a pdf file, That pdf is merged with multiple invoices. I want to extract data from each invoice 1 by 1. I want to solve this by using document understanding . With my knowledge i have tried with form extractor it is not working. Can anyone guide me how to extract data from those invoices.
I’m attaching my pdf below for reference.

Required Output:

Invoice 1:
Invoice number, invoice date, Account number, Table, Sub total, tax, Discount, Grand total.

Invoice 2:
First name, surname,Member no, Dob, Main member - Name, Liberty health provider no, Admission date, Discharge date, Table, Cost

invoice 3:
Claim number, Personal health number, Name , Dob, Mail, City, Postal code, date of Accident, Table

111900492434ImagePDF-merged.pdf (2.9 MB)

Robinnavinraj_S · November 18, 2022, 5:30pm

Hi @Learner007

I have seen your pdf file , I assume that all the invoices are single page only, so you have to split this merged pdf based on each page. Extract the splitted pdf one by one using document understanding and append the extracted value

Regards
Robin

Learner007 · November 18, 2022, 5:35pm

Hi @Robinnavinraj_S in sample pdf i have single invoice , there are some cases that i will get multiple pages. Is there any option to identify that and split my invoices.

sharon.palawandram · November 21, 2022, 5:53pm

Hello,

You can easily split the PDF before you digitize and extract so your PDF gets split into individual invoices. You can easily do this by adding pdf activities. Here’s a tutorial on how you can do either single page or dynamic page.

sharon.palawandram · November 21, 2022, 5:54pm

Here’s how you can split a PDF into dynamic ranges, you can follow this tutorial :

Topic		Replies	Views
How to get information on each page of PDF Activities pdf , studio , question	7	1209	December 10, 2020
Multiple Invoice in single pdf Document Understanding	8	1624	October 11, 2023
How to read multiple invoices from 1 pdf file? Document Understanding	11	2522	October 13, 2023
Dear masters, how to extract data from combined file(more than one page with pdf extension) and store data sperate excel files in Document understanding Studio studio , question , activities_panel	4	763	January 31, 2023
Extract Data from one PDF file containing Multiple pages of Invoices Studio excel , database , pdf , activities , studio , question , ml , ai_center , tools	2	3196	April 11, 2022

Need help in PDF extraction using Document Understanding

Related topics