Read specific data from the PDF and write to CSV file

Hi AIl… l need to read specific data from the PDF (Highlighted in yellow) and write the data to the CSV file. Kindly help.

Read PDF:

Write CSV

image

Attaching the PDF test11.pdf (227.8 KB) for reference

if the PDF Format Remains Same we can do it

@Pradeep_Shiv Yes, the PDF format remains the same

are you from Emids??

Employee ID Format??

This Will give you Employee ID : [A-ZA-z]+\d{3,4}
This will give you Name : (?<=Mr.|Ms.|Mrs).*

@Pradeep_Shiv Employee ID Format is also the same as show in the screenshot “INEMP234*”

Do onething Write that file in TextFormat and Send

@Pradeep_ShivTest.txt (1.7 KB)

make True for Preserve Format in ReadPDFText activity Property Panel and send

@Pradeep_Shiv Can you elaborate on this plz

This Will give you Employee ID : [A-ZA-z]+\d{3,4}
This will give you Name : (?<=Mr.|Ms.|Mrs).*

in this fromat its quite tough

@Pradeep_Shiv Could you help me get only INEMP913 from the string

“Ref: HR/Personal File/INEMP913/4-6 November 3, 2011”

is INEMP is Fixed???
@veerishu

@Pradeep_Shiv YES


Click on the link its working you can check
INEMP\d{3,4}

@Pradeep_Shiv : Thanks for your valuable time and help.

However, the below code helped me solve my problem.

StartIndex = InputData.IndexOf(“WORKED HOURS”)

EndIndex = InputData.IndexOf(“SPECIAL PAYMENTS”)

WorkedHours = InputData.Substring(StartIndex,EndIndex-StartIndex).Trim

2 Likes

good

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.