Convert pdf to textfile

Hi all,
How to convert 9 pages PDF file to text file. I need to convert the entire data.

Please let me know

Hi @0bb4628e217fd43ac86ac9294

  1. Install PDF Packages - UiPath.PDF.Activities
  2. Read Pdf text Activity - Range=“All” - Output=pdfText
  3. Write Text File - Text=pdfText

Hope this helps

Hi @0bb4628e217fd43ac86ac9294

Please use the below mentioned package for the PDF activities.

UiPath.PDF.Activities

image

Use read pdf text and all the pages in that pdf will be read and give some variable as the output of the read pdf text and pass that variable into the write text file. In the Write to Filename please provide the text file name as you wanted.

Use read pdf with OCR in case the PDF is a scanned file and all the pages in that pdf will be read and give some variable as the output of the read pdf text and pass that variable into the write text file. In the Write to Filename please provide the text file name as you wanted.
For scanned pdf please use the below format.

Regards

Hi @0bb4628e217fd43ac86ac9294

Check out the video link

Regards
Gokul

@vrdabberu
Can you please show the properties of Read pdf with OCR and Tesseract

Hi @0bb4628e217fd43ac86ac9294

Properties of Read PDF with OCR

Properties of Tesseract OCR

Regards

I am able to extract everything using Microsoft OCR but I am not able to extract particular amount which is surrounded by #

Example Amount:#34,876,877.000#

Hi @0bb4628e217fd43ac86ac9294

Try changing the scale option in Microsoft OCR. The minimum scale value is 0 and the max value will be 5. You can increase the value in .5 also, So try changing the values so that you can achieve the desired output at a particular value.
image

Regards

Ok Thank you will check and get back

Hi @0bb4628e217fd43ac86ac9294 ,

Thanks for reaching out UiPath Community.
You can follow below steps:

  1. Install PDF Packages - UiPath.PDF.[Activities]
    (Activities)
  2. Read Pdf text Activity - Range=“All” - Output=pdfText
  3. Write Text File - Text=pdfText

Hope this works for you.

Happy Automation,
@Vinit_Kawle

Hi @0bb4628e217fd43ac86ac9294 ,
You can try this step


regards,