2 type of pdf format

Hello expertise.
There are two type of pdf format, one is text format and another one scaned and converted to pdf, so is there any logic to separate these type of pdf format.
example11-23-2018_DELEGAIT-MULTIRATIONAL_RETAINER_Billing.pdf (179.3 KB)

1 Like

Refer the below link

1 Like

Native PDF - Data Scraping can be used to extract the data.
Scanned PDF - Read PDF Text with OCR.

If you have idea of Regular Expressions, You can use Regex in both Native and Scanned PDF to extract the data

1 Like

Can we seperate this two type pdf with a condition?

1 Like

How will robot reconize that its scaned pdf or native pdf format?

1 Like

Hi,

Use Read Pdf text activity, and check if the output string length is greater than 0, if the output string length is greater than 0, its native pdf otherwise Scanned pdf.

1 Like

Thank you so much.

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.