Extract data between characters from PDF

Alex_Alloin · January 8, 2020, 8:26am

I am trying to exact some specific data from a PDF file Using IUPath.

The data in PDF looks like this :

**** hjeprj3 **** James Fish **** JDR0929879 **** jdloin2 B5339 ||

I need to exact the name “James Fish”. It will always have the same format, always **** text **** first name last name **** text And I will always have to extract the name, which will be different.

I used this function : PDFText.Substring(PDFText.IndexOf("**** ",0)+15,16) but as the name will always be diferent it doesn’t return the correct value.

Can you kindly advise?

Thank you for your help

ArunVelaayudhanG · January 8, 2020, 9:48am

Hi @Alex_Alloin,

Welcome to UIPath community.

Try this and let me know if this works:
PDFText.Substring(PDFText.IndexOf(““,PDFText.IndexOf(””)+1),3,PDFText.IndexOf(““,PDFText.IndexOf(””)+2))

Here, am trying to get the substring between second and third occurance of ****

Thanks,
Arun

Alex_Alloin · January 8, 2020, 9:53am

Hello @ArunVelaayudhanG,

I copied your function and here is the error I got :

Error ERROR Compiler error(s) encountered processing expression “PDFText.Substring(PDFText.IndexOf(“”,PDFText.IndexOf(“”)+1),3,PDFText.IndexOf(“”,PDFText.IndexOf(“”)+2))”.
Overload resolution failed because no accessible ‘Substring’ accepts this number of arguments. Main.xaml

Thank you for your help.

AshwinS2 · January 8, 2020, 10:02am

Hi @Alex_Alloin

You can get the value by

PDFText.Split(“****”.ToCharArray)(1).ToString

Try this

Thanks
Ashwin S

Alex_Alloin · January 8, 2020, 11:03am

Hello,
No error on compilation his time.
But is seems that there is a space after the “***” so it returns “”.

Alex_Alloin · January 8, 2020, 12:12pm

Hello, It is working with this :

Strings.Trim(Strings.Split(PDFText.Substring(PDFText.IndexOf(“**** “,0)+15,30),”*”)(0))

Thank you for your help

Topic		Replies	Views
Data extraction from PDF Help pdf , activities	10	15282	September 27, 2018
Data Extraction from a PDF Help	5	4979	September 13, 2017
Dudas al tarer información StudioX studiox , question	0	275	October 31, 2023
String Manipulation - Extract fields from PDF by knowing exact Indexes Help	3	2492	May 6, 2019
Extracting a substring from a text Studio studio , question , activities_panel	6	892	June 28, 2022

Extract data between characters from PDF

Related topics