Issue with PDF data extraction

Hi,

I am testing the PDF data extraction, but I only abe to retrive data from 1 PDF instead of data from all PDF in a specific folder. Pls see attached. I didn’t encountered any error. DtOutput.xlsx is the end output.
Appreciate some expery will help. Thank you.

Regards,
BlankProcess_PDF02.zip (620.9 KB)
Peter G.

Hi,

I Have modified your solution, now it is able to write all the data

BlankProcess_PDF02.zip (625.6 KB)

Hi,

But once I change the path, error appaear in Read PSD Text function?

image

Hover over the red icon, what is the error?

image

Can you try changing file to file.ToString

Alternatively, in your For Each activity, change the TypeArgument from Object to String

image

Hi,

I changed to file.tostring and it worked perfectly.
Appreciate your help.

Thank you.

Regards,
Andros

1 Like

image
image

Hi,

Sorry, I notice 1 error. The PDF file from is AHSENG COMPANY, but the Excel result shown only AHSENG, the word COMPANY is missing. Is due to the space between the 2 words. How to resolve this ? And assuming the name is “AHSENG COMPANY LTD” - 2 spaces in between… How to handle this ?

Apprecaite your advise pls. Thank you.

Regards,
Andros

Hi,

If the pdf format is fixed and invoice is constant

(?<=FROM\s).*(?=\sINVOICE)

or

if company is constant for every company name use below one

(?<=FROM\s).*(?<=\sCOMPANY)

HI,

Sorry, I am not sure what u mean.
What to change for the REGEX below ? for the FROM field ??

image

yes, please replace the regex for from in data table with the closet pdf file pattern

But when I changed the FROM REGEX to below : The output show only 1 PDF line ?

image
image

Hi,

I dont know why you are getting output in this manner i just run the workflow
i am able to get the output
Capture
i am reattaching the workflow go through it once
BlankProcess_PDF02.zip (625.5 KB)

Hi,

The statement below added the word COMPANY behind characters of the field FROM.
image

But what I mean is that 1 of the PDF file, the field FROM is : AHSHENG COMPANY.

But in the initial config, only the word AHSHENG appear in the outout but the 2nd word COMPANY didin’t shown ?

Thanks.