Extract scanned PDF to excel

chandra_raju · August 12, 2020, 8:53am

Hi Folks,

I need to be extract the data from scanned PDF, columns like Policy, Eff,Insured, TYpe, Invoice, Gross Prem, Comm% and Invc Amt Paid and move the data into excel
Note : In one pdf it may contains 2 pages in another pdf it may contains 8 pages in such a case i need to extract data from all the pages and need to convert to excel.

KIndly help with the solution.
Cochrane (1).pdf (54.5 KB)

Srini84 · August 12, 2020, 9:15am

@chandra_raju

You can use Read PDF activity if it is plain text pdf OR Use Read PDF with OCR for Image PDF
and write to text file

From the text file, you have to do regex, depends upon the requirement

Hope this will helps

Thanks

sachinbhardwaj · August 12, 2020, 9:43am

Hello,

Two methods:
@Srini84 already told you one,
secondly, use Digitize Document activity in Intelligent OCR and Use OmniPage OCR, you will get better results

chandra_raju · August 12, 2020, 4:36pm

Thanks @Srini84 & @sachinbhardwaj for the support.
I need to remove the below data from either text file or while extracting the data from PDF

If it may have multiple pages as well i need to be extract only structured data from pdf and i need to be write into excel is there any approach for this solution. Below is the refernce data.data.txt (1.5 KB)

Pratik_Wavhal · August 12, 2020, 5:56pm

Hi @chandra_raju

Below is the workflow for the same :-
MainPratik.xaml (16.7 KB)
Cochrane (1).pdf (54.5 KB)
abc.txt (2.6 KB)
data.txt (1.5 KB)
output.xlsx (9.1 KB)

Output :-

Regex used :-

Mark as solution and like it if this helps you

Happy Automation

Best Regards
Er Pratik Wavhal

system · August 16, 2020, 8:46am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extraction data from pdf to Excel Activities pdf , activities , question	9	430	October 2, 2023
PDF Extracxation Studio studio , question , project_panel	2	643	October 20, 2022
How to extract details from PDF to Excel Studio uiautomation	4	674	December 4, 2022
Extract dynamic Page PDF to Excel Studio studio , question , activities_panel	21	842	March 16, 2023
Different pdf to excel Help studio	14	2438	February 11, 2020

Extract scanned PDF to excel

Related topics