Error in data scraping

Hi, I’m trying to use Data Scraping on a table in PDF, however, I encountered this error:-


Error message: This control does not support data extraction.

I note that the table in the PDF is structured, as I am able to highlight the rows of data using my cursor:

May I know how I can solve this error?

Thank you!

@schee013

As the Data scraping can’t identify the PDF tables

For this you need to use Read PDF activity, which will give you a text file without structure

OR

You require to train the model using Document Understanding activities

Else
You can choose your own AI OCR’s like Abbyy Flexicapture, Forms Recognizer but those are Paid license

Hope this helps you

Thanks

Hello ,
In this video, I extract tables from PDF and write data in Excel:

0:25 Install PDF Activities
1:10 READ PDF text, Get PDF page count, Extract PDF
5:40 Read PDF with OCR
6:55 Join PDF and Manage PDF passwords
9:30 Extract Images From PDF and Export PDF as Image
12:00 Extract table from PDF use-cases 1 replace some spaces with | (one column has multiple words)
24:00 Run the robot to see the result
25:40 Extract Table from other PDF use-cases 2 delimiter is 2*spaces " " easy split
31:50 Extract Table from complex PDF use-cases 3 unstructured data the logic will be based on IsUpper and IsLower
40:25 Extract the price value from PDF

Thanks,
Cristian Negulescu