Need Regular expression for dividing table information

I got table information from pdf like this…now i need to identify no of columns and each column value in separately…

PARTICULARS HSN CODE RATE AMOUNT
Consultancy services 9983 42300 42300
Total 42300
Taxable Value 42300
ADD IGST 18% 18% 7614
Total 49914.00

Required out put -
PARTICULARS - consultancy services
HSN CODE - 9983
RATE - 42300
AMOUNT - 42300
ADD IGST 18% - 7614


Total - 49914.00

Hi @sindhura506

Can you provide the data by reading the document with format and without format so that applying Regex will be Easy

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

Thank you @Pratik_Wavhal
Here is the text file what i’m getting after using read pdf activity…output.txt (608 Bytes)

using regular expressions i’m able to getting GST NO, INVOICE NO, INVOICE DATE but i need table data also…

HI @sindhura506

Can you share the Pdf Data again by enabling the PreserveFormatting to True as shown below :-

image

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

Here is the output file…enabling the PreserveFormatting to True
output.txt (1.1 KB)

Hi @sindhura506

Below is the workflow for the same :-
MainPratik.xaml (10.7 KB)
output.txt (608 Bytes)

Note :- I have used the output.txt which is not enable as PreserveFormatting

Output :-

image

Mark as solution and like it if this helps you :slight_smile:

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

i’m getting this error…what is the missing activity??

Hi @sindhura506

Below is the img for the same :-

image

Its Matches Activity

Mark as solution and like it if this helps you :slight_smile:

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

Can you send me the regular expression what you have used in matches activity…i have matches activity in ‘Activities’ but it is showing same error…do i need to install any other packages?

Hi @sindhura506

I think my and yours Matches Activity Package Version is different so it is not showing. If its possible for you to upgrade then try it once. Bcz i am using all the latest version Packages.

Regex used as below :-

"(?<=ADD IGST).+(\d{4})" 

If the No can be vary in Size then untill what max no can be you can put in Comma , within Curly Bracs like {4,7}

Mark as solution and like it if this helps you :slight_smile:

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

1 Like

Thank you @Pratik_Wavhal…Thank you so much…

Hi @sindhura506

My Pleasure
You are welcome

Happy Automation :raised_hands:

Best Regards
Er Pratik Wavhal :robot::man_technologist:t4: :computer:

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.