Unable to read pdf using Read pdf text Activity

Hi all,
Good Day

I am trying fetch values and it’s in the text format only. But while using Read pdf text activity some of the values are getting corrupted kindly help me to resolve the issue.

Hi @Buvaneshwaran_R

Have you tried with read PDF with OCR?

Regards

Hi @Buvaneshwaran_R

Can you share the PDF file if possible

Try with @pravin_calvin suggestion also.

Regards
Gokul

1 Like

Hi!

We have two types of pdf’s 1.Native PDF 2.Scanned PDF

Native PDF:

Native PDF is nothing but where we can select the text and extract the text by using read pdf activity.

Scanned PDF:

Scanned pdf where we can not select the text which is the type of image for extracting the text from this kind of PDF’s we can use Read Pdf with OCR use the Tesseract OCR engine to extract the text from scanned pdf.

Regards,
NaNi

HI @THIRU_NANI it’s native pdf only we can copy text manually but while using Uipath only we are getting corrupted values and i tried to use ocr aswell that’s also not getting accurate values.

Hi!

Could you please provide us the input file?

And also show us the values which you’re getting from the read pdf and also read pdf with ocr.

Regards,
NaNi

Hi @THIRU_NANI

It’s client one Thiru we can’t get out.

Hi @Buvaneshwaran_R

In your case just delete that corrupted PDF and replace the new PDF file

Before Using both Read PDF and Read Pdf with OCR

Use delay activity → 00:00:05

Regards
Gokul

PDF file is not corrupted but while using the data in pdf we are getting this problem

Hi @Buvaneshwaran_R

You can try to RollBack the package

Go to ->Manage packages → Uipath.Pdf.Activities

Can you share the sample PDF file. Not the original one

Regards
Gokul

Hi!

Could you please try this component to get it done.

Regards,
NaNi