Destroyed pdf format when using ExtractPDFRange

TP2B · September 1, 2021, 8:15am

We are splitting a PDF into seperate PDF files. But some of them have strange fonts/formats:

The original was looking like:

There is no OCR used.

MateuszSzatkowski · September 1, 2021, 8:22am

Hello Tobias,

Once I had similar issue and I used

(In properties You can Preserve Formatting )

and then I used regex in order to separate / extract data. Hope if helps

Palaniyappan · September 1, 2021, 8:24am

Hi
Is the pdf editable
If so then use a normal READ PDF activity or
If not use READ WITH OCR and use Omnipage ocr to extract the text

Cheers @TP2B

TP2B · September 1, 2021, 9:33am

Thanks for your responses. But I currently don’t read anything. It’s the process “Extract PDF Range” only. So the output is a PDF and I opened it with ACReader (see screenshots)

TP2B · September 1, 2021, 1:05pm

was a config error, the user extracted manualy by printing a page from a PDF to a PDF

system · September 4, 2021, 1:06pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extracting data from PDF-s Studio uiautomation	6	833	July 27, 2022
PDF Extraction---- help Studio pdf , studio , question , activities_panel , pdf-extraction , emailtopdf , pdf-conversion , pdf-to-image , pdf-tag	3	831	October 7, 2022
Read PDF Text does not separate columns correctly Help pdf , activities , faq	3	1410	November 26, 2020
How to extract pdf to write text file ? Help Studio activities , studio	3	537	November 3, 2022
Text Extraction From PDF - With Layout Retained Activities pdf , activities , question	2	1239	August 18, 2021

Destroyed pdf format when using ExtractPDFRange

Related topics