Hi there,
I am trying to extract table from a pdf (screenshot below):
I have used Data Scraping extraction wizard. However, I am unable to extract column name correctly. Following is the output:
Blockquote
[Column-0,Participants,Ballots Completed,Ballots Incomplete/ Terminated,Results,Column-5
Blind
,5
,1
,4
,"34.5%, n=1
","1199 sec, n=1
"
Low Vision
,5
,2
,3
,"98.3% n=2
(97.7%, n=3)
","1716 sec, n=3
(1934 sec, n=2)
"
Dexterity
,5
,4
,1
,"98.3%, n=4
","1672.1 sec, n=4
"
Mobility
,3
,3
,0
,"95.4%, n=3
","1416 sec, n=3
"
]
Blockquote
Question: How can I improve the table extraction to get correct column names?
Cheers
Below is the screenshot of extraction wizard:
Here’s how my code looks like:
@husain.shah Are you using SilverLight Extension ?
No. I am not using silverlight ext.
@husain.shah Is the PDF file stored in your System?
Yes. i downloaded the pdf from the source above and working on a local copy.
@husain.shah Then Have you tried PDFtoExcel Activity ?
PDFtoExcel Activity uses SautinSoft api which has a trial version that only converts 3 pages of PDF and it is for evaluation purposes only. I am interested in a free solution.
Hi Hussain,
Were you able to find the solution?
shero
June 17, 2020, 5:51am
13
hey did you find any free and viable solutions to extract data table from pdf?
Hello shero,
Yes, I tried epsilon package for the same. However, it is not the best solution but definitely worth a try and it is free of cost.
Link below -
https://epsilonai.com/how-to-extract-table-from-pdf-in-uipath
shero
June 17, 2020, 11:13am
15
It did not work accurately for me.
I want to extract tabular data row wise based some regex.
@supermanPunch @Shah_Hussain @husain.shah
@shero I am looking for same .
But even if exact the table from pdf into datatable would work for me , but without data scrapping and epsilonAI activity or third party package (coz of security reasons)
Please guide
It’s asking license key can you provide that license key file
Please
@shero @Sakshi_Jain @siva_sankar
Hello guys,
Yes, Epsilon has started asking for subscription keys now. However, I am working on this and will get back to you guys very soon.
1 Like
Thank you
Please update once because lot of pdf work in that format