How to Read Table from PDF

pdf
studio

#1

i have one pdf in this pdf two table is there .when i reading the pdf the all table data is converted into Text . i want the table data as a dataTable format???


#2

U can use Data Scraping(Under Design Tab) method to get the table data as datatable

Or

U can use Extract Structured data Activity to get the table data


#3

Both are same.:wink:


#4

ok …


#5

i am trying with data scrapping getting Error ‘Control does not Support data Extraction’


#6

Then you can’t i guess.
How about screen scraping? though info won’t be in table format but just give a try.


#7

Hi,

I’m able to extract the tables in pdf through data scraping wizard when the pdf window is opened on the screen but I don’t know how to read the pdf and identify if tables exist in the same or not and if they exist, then how to extract them one by one. Could you help me on this.

TIA.


#8

Hi,
Is pdf format is structure?
If you used data scraping and if table exist then it returns the value(to validate use this in if condition datatable.Rows.Count.ToString) if number of rows 0 then there is no table or it might throw an error(you can catch using Try/Catches activity) then you can proceed further.


How to extract data from table which is in pdf format?