How to Extract a particular Data from a pdf file?

Hi,
I need to extract only the data highlighted in the pdf .
Here, I am attached a SS . Please check and help me with a solution like how to do the same .
Here, below I need to extract only the date like " 30-JUN-2019" from invoice date info.
(Key - Value Pair) ; Need to extract only the value and not the key.

image
image

Thanks in advance :slight_smile:


1 Like

Hi @PDcoder
-Use Read pdf activity to read the pdf file
-Use Regex to get the particular data

Thanks

Kumar

1 Like

Hi @kumarD

Can you help me with an example of the same?

Thanks

Hi @PDcoder

you can refer to these link:

cheers :smiley:

Happy learning :smiley:

3 Likes

Yes @pattyricarte,
I have used scraping to extract data but with this I am able to extract the whole data from pdf.
Thanks.

1 Like

Hi @PDcoder

can you share your pdf and pinpoint what data you want to scrape. Many Thanks.

cheers :smiley:

Happy learning :smiley:

1 Like

Hey @pattyricarte,

I have already shared the key value pair above in the main topic. Please check for reference.
Thank You :smiley:

1 Like

Hi @PDcoder

So for my perception you get the value of image right? And the only value that you need to get is the date? am i correct :smiley:

cheers :smiley:

Happy learning :smiley:

1 Like

You can use regex : “(?<=Invoice Date:).*” and after that use Trim if you need to trim the values

2 Likes

Hey @PDcoder
-If you want the date from this type of string like-> (company date price), here before and after the date you have some data, so you can use regex here.
-Use regex activity (Matches) and the pattern is => (?<=company)(.*)(?=price)
and here the result you will get = date

This is a demo for you---->
Main.xaml (10.4 KB)

Try on this way

thanks

3 Likes

Thank You Everyone :smiley:

Hi @PDcoder

No worries !

cheers :smiley:

Happy learning :smiley:

2 Likes