PDF specific Data Extraction (Table Format)


I need to take all ISIN number from the whole PDF

Hi @Santhosh_Kumar3

Welcome to UiPath Forum.

Hope the below link will be helpful.

In addition, If your PDF is a scanned image then go for OCR based PDF activities.

Please go to manage packages and install UiPath.PDF.Activities. Once you install that package, you will be able to see PDF activities.

You can use Read PDF Text (For normal PDF file) or Read PDF Text with OCR (Scanned PDF) activities for your requirement.

Thank you.

but i need to scrap that ISIN column only other column have data
I used that also by using regex I unable to get data

with out using DU if have possible to get data

@Santhosh_Kumar3

If the PDF is plain PDF then you can use as below

Now by checking the output you can use Regex to extract

Hope this will help you

Thanks

Hi @Santhosh_Kumar3

Please follow the below step to get specific column value.

Step 1: use ‘Read PDF Text’ activity to get the entire data from PDF file. Assuming the PDF having non-image content. If its have Scanned documents as PDF, make use of PDF with OCR

Step 2: Understand string pattern for ISIN column and apply the specific regular expression on entire text data you received.

For more details about RegEx in UiPath, you can look into the below,

Hope this will be useful. Thank you.

once I check and update Si/Madam

Thanks sir

Is this issue resolved? Thank you.

Resolved Thanks Sir

Could you please mark the helped post as ‘Solution’ so that it will be helpful for others as well?

Thank you.