PDF extraction From input file data

Hi Everyone,
Am having a doubt on pdf extraction part can you please help me on it below the details.

→ Am having single pdf having with 8 pages in that am having multiple patients accounts information
->I need to extract each patient info while using readpdftext activity in UiPath it was stopping the exceed limit because more than characters having in pdf. unable to use the readpdftext activity so instead of that how to extract the individual patients details. and need to load in Excel

Hi @jyothi_kakaraparthi

You can use read pdf with tesseract ocr to get the pdf information. Then use matches activity to get the specific details and then use for loop to add the data to the datatable. Please reply if it’s work or not.

Regards,
@supriya117

Hi @jyothi_kakaraparthi ,

Could you let us know more about the Error message that you are receiving and also the version of PDF activities being used ?

We could maybe have a workaround by looping through the pages one by one in the PDF and extracting just One Page Data at a time and performing the operation required.

Hi Everyone,

I was struck with the above solution can you please help me on it

Am having 10 pages in single PDF in that I have 25 patients account information structure data with table data I need to extract the every patient information individually and load to excel!!!

how to extract the each patient information please give me the suggestions for this I was tried with multiple scenarios. but am not getting as expected

@jyothi_kakaraparthi ,

We noticed that the initial post had a different issue highlighted as well, in terms of reading the PDF document itself. Is that the case even now or is it a matter of the data extraction only ?

Same case Main concern here extraction part

@jyothi_kakaraparthi ,

Are you able to read just one Page of the PDF ? You can configure the Page No. to be read in the Range property.

Also, as asked previously if you could provide us with the details as mentioned, we can further investigate on it.

re you able to read just one Page of the PDF ? You can configure the Page No. to be read in the Range property.

I can read the single page in PDF at a time using document understanding concept but unable to do multiple pages using document understanding so stroked.