Dear forum,
I’m currently working with the Form Extractor in UiPath Document Understanding.
I have a multi-page PDF (5 pages), but I’ve only created a template for the first page.
I would like to apply this same template to every page of the PDF (or even to other PDFs with a similar structure).
Is there a way to configure the Form Extractor to process all pages of a PDF using the same template, without having to create separate templates for each page?
Any advice, tips, or best practices would be greatly appreciated. Thanks in advance!
Hi there,
If the layout of each of the pages is the same, you can simply split the pages into multiple PDFs using the PDF activities:
- Get PDF page count activity
- Insert the PDF page count into the Repeat Number of Times activity (make sure the Start at of the Repeat Number of Times is 1)
- Insert the current index into the Range of the Extract PDF Page Range activity
- Provide the new file path into the OutputFileName of the Extract PDF Page Range activity
- Perform the Document Understanding (the Form Extractor) for the new file path
Regards,
Azeem
1 Like
Thank you for your response, @azeem_rosli
However, the PDF files might contain over 100 pages. If we split the pages and process them individually, it could be time-consuming.
Is there an alternative approach to handle this without splitting the pages?
Thank you once again for your help!
Hi,
You can use parallel for each but it will become resource intensive. Therefore it is better to perform them in batch, such as:
Batch Extraction.xaml (10.1 KB)
You can share them with other robots using queues, etc based on your creativity.
Regards,
Azeem