Get data from unstructured PDF

Hi Guys,

I have the following.
We have a document with more than one container.
I need the information for these containers seperate.

CNTR NO. : BMOU4574924 PIN CODE: 1095671722
RETURN DEPOT : See our Website: Shipmentlink.com/nl (Cargo Tracking)
RELEASE EXPIRE : 20180629 23:59
RELEASE DEPOT : ROTTERDAM WORLD GATEWAY B.V.

So I need:
Container NO:
PIN CODE
RETURN DEPOT
RELEASE DEPOT

For each of the containers metioned in the PDF

Can you guys help me?
Evergreen Release.pdf (26.9 KB)
Please see attached doc

Thanks!

use sub string between two strings to get required dat

@marbroer

please check this file…
Main.xaml (8.3 KB)

Thanks!
But I receive an error.


Activity could not be loaded because of errors in the file.

Can you help me with this?

import
uipath.core.activities
system.activites.statement

They are imported,
But…

use this condition in switch activity
If(item.ToUpper.Contains(“CNTR NO”),“CNTR NO”,If(item.ToUpper.Contains(“RETURN DEPOT”),“RETURN DEPOT”,If(item.ToUpper.Contains(“RELEASE DEPOT”),“RELEASE DEPOT”,“Default”)))

@marbroer

1 Like

@marbroer

problem solved ?

This works perfect!
Thank you!

@marbroer
Great…
I have included codes which can extract exact information out of string…i can post you want …those are just simple split and substring operation…

Please close topic my marking most useful reply as solution so that other member can find answers easily…

Regards,
Aditya☺️

Just out of curiosity, why did nobody make a solution using Regular Expression?
I’m pretty sure that it would be a nicer solution - might give it a go later and upload :slight_smile:

O Yes, please Post and I will mark your post as Most Useful

Hi Guys,

Me again!

For this part:
CNTR NO. : BMOU4574924 PIN CODE: 1095671722
I need a regex which only extracts the data behind PIN CODE:

Can you help me?

@marbroer

this should work…
"Pin code “+item.Split(”:"c)(2)

Regards,
Aditya

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.