Hello expertise.
There are two type of pdf format, one is text format and another one scaned and converted to pdf, so is there any logic to separate these type of pdf format.
example11-23-2018_DELEGAIT-MULTIRATIONAL_RETAINER_Billing.pdf (179.3 KB)
1 Like
Refer the below link
1 Like
Native PDF - Data Scraping can be used to extract the data.
Scanned PDF - Read PDF Text with OCR.
If you have idea of Regular Expressions, You can use Regex in both Native and Scanned PDF to extract the data
1 Like
Can we seperate this two type pdf with a condition?
1 Like
How will robot reconize that its scaned pdf or native pdf format?
1 Like
Hi,
Use Read Pdf text activity, and check if the output string length is greater than 0, if the output string length is greater than 0, its native pdf otherwise Scanned pdf.
1 Like
Thank you so much.
1 Like
This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.