Document understanding extract data in pdf where pages is dynamic

Vincent_Nuestro · July 5, 2023, 1:34am

Hi Good day,

I have a problem using DU where the number of pages is dynamic or sometimes it is 4pages, 2pages and More.

The initial DU I created is designed for 4pages PDF file, now there are some scenarios that is more than or Less than 4pages.

Original Design in 4pages PDF file:

GET data from the 3rd page of the File.

New Scenario 2Pages PDF File:

The data from the 3rd page of PDF file is moved in the 1st Page of PDF File(New Scenario)

For your Reference: This is where i followed the DU
(48) UiPath Document Understanding: Extract Tables Out of PDFs - YouTube

Thank you in advance.
-Vincent

supriya117 · July 5, 2023, 4:50am

Hi @Vincent_Nuestro

For dynamic pages, use can use regex or ML extractor instead of form extractor.

Regards,

Kailas_Kanade · July 5, 2023, 5:12am

For this u try this
step 1) Digitization under use UiPath Documents OCR
step 2) For extraction
-implement ChatGPT extract information or implement Regex Or Advanced DU
step 3) extraction O/P store in excel or othe

supermanPunch · July 5, 2023, 7:44am

Hi @Vincent_Nuestro ,

Maybe performing a Classification to properly classify the Page as required would help for this scenario.

Is there a Classification method already in place ?

Topic		Replies	Views
UiPath data extraction - extract from a dynamic page Studio question	2	104	May 20, 2024
Not able to extract all pdf page data using Document Understanding Something Else feedback	4	750	June 28, 2022
Uipath DU Extract country Activities excel , studio	5	311	October 7, 2023
Read multiple pages of a PDF file Document Understanding	10	4516	April 26, 2022
Multipage pdf extraction Studio uiautomation , activities , studio , pdf-extraction	1	759	July 21, 2022

Document understanding extract data in pdf where pages is dynamic

Related topics