Extracting text from a table in PDF file

Hi everyone!

I have a problem I hope ya’ll can help me with. Basically, I’ve succeeded in extracting the platform type from the pdf document as shown below.

Image%205

I used regex via the assign activity to capture the “Xbox” bit of that field with the following expression.

Code

Now I’d like to extract row information from the following table:

Table

How can I use regex (or anything) to extract the table information into string form and then deposit it into an excel table?

Thanks in advance!

can you upload a sample copy?

Hi! Sample of? PDF table or excel sheet?

Sample Pdf

Hey @ashley11

ExtractTableValues.pdf (9.0 KB)

Here you go!

@Spacecats7 From the pdf what data you are expecting to get

Hi @Spacecats7,

Please look into the below workflow of extract data from Pdf and stored in Excel file.

Thanks,
Mohanraj.SBookings.zip (3.2 MB)

1 Like

Hello @Spacecats7
was a busy day couldn’t reply. I have made the XAML hope it will solve your problems.
Regex_Table.zip (22.5 KB)

If you face any problems feel free to ask.

Regards
Ashley Nihal Dcunha

1 Like

Thanks for the help! Very thorough – fortunately, data scrapping works perfectly in this case too!

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.