I’m trying to extract data from the table inside a standardised form from PDF into excel and also extract the number given beside the title Job Code. For the table data to be extracted into excel, it would need the column headers and all the data in the correct format not all in just one cell. Any suggestions for the best approach please?
It will be in both digital and printed versions. Will have to do both scenarios, will try the digital first. So for the digital part would it basically be:
Read PDF Text
Data Scraping
Extract Structured Data of the table part
Excel Application Scope
Append Range
It doesn’t seem to work correctly without having the PDF window open and also it doesn’t output the column headers. Is there an activity to open the PDF window before it can data scrape properly then?