i have pdf file i am extracting pdf data using Regex. i have extracted --Account number, open balance, close balance.
but i cant able to extract Table data using regex .
can anybody help me .BankStatement.pdf (15.9 KB)
Extract tables may be very complex using regex so I am suggesting either to go with datascraping or screenscrpping or table extraction feature using document understanding
Hope it helps you
some pdf files not supported to use data scrap, that’s why i am asking other way to get Table data.
So try with document understanding feature in Uipath to extract tables from PDF
In this video, I extract tables from PDF and write data in Excel:
0:25 Install PDF Activities
1:10 READ PDF text, Get PDF page count, Extract PDF
5:40 Read PDF with OCR
6:55 Join PDF and Manage PDF passwords
9:30 Extract Images From PDF and Export PDF as Image
12:00 Extract table from PDF use-cases 1 replace some spaces with | (one column has multiple words)
24:00 Run the robot to see the result
25:40 Extract Table from other PDF use-cases 2 delimiter is 2*spaces " " easy split
31:50 Extract Table from complex PDF use-cases 3 unstructured data the logic will be based on IsUpper and IsLower
40:25 Extract the price value from PDF