Unable to get 'PDF Table Data' Using Regex,String manipulation

hi ,
i have pdf file i am extracting pdf data using Regex. i have extracted --Account number, open balance, close balance.
but i cant able to extract Table data using regex .
can anybody help me .BankStatement.pdf (15.9 KB)

Hi @Anand_Designer

Extract tables may be very complex using regex so I am suggesting either to go with datascraping or screenscrpping or table extraction feature using document understanding

Hope it helps you

Regards

Nived N :robot:

Happy Automation :relaxed::relaxed::relaxed::relaxed:

1 Like

some pdf files not supported to use data scrap, that’s why i am asking other way to get Table data.

So try with document understanding feature in Uipath to extract tables from PDF

1 Like

Hello Anand,
In this video, I extract tables from PDF and write data in Excel:

0:25 Install PDF Activities
1:10 READ PDF text, Get PDF page count, Extract PDF
5:40 Read PDF with OCR
6:55 Join PDF and Manage PDF passwords
9:30 Extract Images From PDF and Export PDF as Image
12:00 Extract table from PDF use-cases 1 replace some spaces with | (one column has multiple words)
24:00 Run the robot to see the result
25:40 Extract Table from other PDF use-cases 2 delimiter is 2*spaces " " easy split
31:50 Extract Table from complex PDF use-cases 3 unstructured data the logic will be based on IsUpper and IsLower
40:25 Extract the price value from PDF

Thanks,
Cristian Negulescu