How to read PDF document when the page has been continued to next page

Hi All,

Good morning.

I have a PDF document, on some pages, data will be continued with the next page that time how to read both pages. sometimes it would be a single page or two pages based on the data. PDF attached and also I have created a workflow that is also attached.
thank you.

note: looking for both the pages(1&2) here

PDFReader.zip (36.9 KB)
UiPathPDFReading.pdf (229.3 KB)

@Raghu_km

Use get pdf page count activity to get the number of pages in the pdf.

In read pdf text or read pdf with ocr there is a property of “range” you can mention the page number you want to extract.Initially it is assigned to “all” you can mention the specific range or specific page number in that property.

@Raghu_km

You can use get pdf page count activity and based on that count you can use the read pdf text and can mention the page number in the specific field.

If you wanted to read a specific page please mention that page number in the marked field as “2” so that it will read only 2nd page. If not you wanted to read the pdf pages from specific number please mention as “2-5,7” so that it will read pages from 2to5 and also 7

If you wanted to read all the pages in a pdf if it contain a single page or else multiple pages you can directly use the read pdf text for normal pdf’s and use read pdf text with ocr for scanned copies of pdf’s

regards

hey, @Raghu_km , I have attached the folder here
PDFReader.zip (263.4 KB)

So this will read all the pages that are available in the PDF
U can then add a if condition or some logic to get expected results

Hope it helps you :slight_smile:

@Raghu_km
Read pdf text activity automatically reads all pages data you want to read specific range data in properties panel "range " is there in that you specify the page numbers like “2” or you also specify the range in between them also “4-6” .

You want to get the pagecount get pagecount activity is there to get the pagecount

@Raghu_km

You can use get pdf page count activity and based on that count you can use the read pdf text and can mention the page number in the specific field.

If you wanted to read a specific page please mention that page number in the marked field as “2” so that it will read only 2nd page. If not you wanted to read the pdf pages from specific number please mention as “2-5,7” so that it will read pages from 2to5 and also 7

If you wanted to read all the pages in a pdf if it contain a single page or else multiple pages you can directly use the read pdf text for normal pdf’s and use read pdf text with ocr for scanned copies of pdf’s

Regards

@rlgandu
in their workflow after reading text they have to manipulate it so while doing that it used to work only for first page even when the read pdf text range is kept “All”

thank you for your response, i just given sample PDF actually PDF we can see more than 20 to 25 pages we need to match PDF to PDF. most of the time we will see one page data some time it will continue next page that time how to capture. page would be dynamic here. in my PDF the data is flowing next page how to capture that one. please assist.

Thank you i will check and let you know.

@Raghu_km

In the above xaml I have taken the in_countPage as Int variable and have assigned the default value as 1. So the loop gets executed and through this you can read the individual pages.

Hope it works !!