How to Extract Data from PDF?

Hi Everyone,
I have multiple PDF’s in path i want to extract Invoice date from each pdf while it is iterating in for each look.how could i do that. which one is better to extract data Browsers(Edge,IE,etc) or Acrobat Reader.
Help me
Thanks & Regards
krishnareddy

1 Like

Fine
–use a assign activity and get the file path of all the pdf in a folder like this
out_filepatharray = Directory.GetFiles(“yourfolderpath”,"*.pdf")
where out_filepatharray is a variable of type string array
–use a for each loop and pass the above variable as input and change the type argument as string in the for each loop activitiy property
–if the pdf is like we can access each element as a individual element then we can use get text activity or Read PDF activity
–so if we are able to access the elements as individual elements then inside the for each loop use a activity Read PDF activity, pass the file path as item (the variable from for each loop which has the file path ) and get all the text from the pdf and extract the text we want using string manipulation
–if not we can get the required text by using a activity called Start process inside the for each loop and mention the file path as item in start process activity
–next use a activity called GET TEXT activity with which select the text as a element that we want to extract and get the output with a variable of type string
–or even if not we can use anchor base activity where select the terms like InvoiceDate with find element or find image activity as anchor in the left side of the activity and in the right side use a activity called GET TEXT activity or Screen Scrapping activity (if image) to get the output
or
if the pdf is a native pdf we can use anchor base activity with Find image activity on left side with which scrape the term Invoice Date as a image and in the right side use screen scrapping method to extract the text we want (screen scrapping is in the design tab)

Hope this would help you buddy
Cheers @krishnareddy

1 Like

@Palaniyappan Thank You…

Cheers buddy
@krishnareddy

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.