Problem with data scraping from PDF file

Olegik_Super · February 7, 2018, 2:18pm

Hello!
I use activity “Get Text” and try to get text from specific element in pdf file, but element detecting not correctly.
I need to get only numbers without characters.

How can i do this?

Bettelej · February 7, 2018, 2:33pm

Hi

If the element you are trying to get contains both the letters and the numbers, then an easy workaround would be the following.

If the element always contains three letters you can get the substring.
yourVariable = NHH 7717027908
yourSubstring = yourVariable.toString.SubString(4, 10)

If there is always letters but not always the same amount, you can get the index og the first space, and do the substring from there.
Again if the amount of numbers are not always the same, you can get the total length of the String, and end it there.

Olegik_Super · February 7, 2018, 2:41pm

thanks!
And if this Element will be dynamic (count of symbols more 10 or less 10) ?
what then needs to be done?

arivu96 · February 7, 2018, 2:50pm

Hi @Olegik_Super,

yourVariable = NHH 7717027908
yourSubstring = yourVariable.toString.Replace("NHH","")

Else you can use regex expression also to get the numbers alone

System.Text.RegularExpressions.Regex.Replace(yourVariable ,"([^0-9])",string.Empty)

Regards,
Arivu

Bettelej · February 7, 2018, 3:28pm

I see you got a good answer with regex, but if you wanted to use substring later on I’ll answer your question anyway.
If it’s dynamic you can say substring from the index of the first space and then you’ll need to know how long the whole string is, which is theString.toString.count as I recall. So the length of the substring would be the length minus the index of the first space.

Topic		Replies	Views
Extract number from pdf Help	7	4961	June 25, 2018
Extract number from pdf without elements to click on Help	9	1612	November 19, 2018
PDF data extraction using get text is not working Studio uiautomation	8	775	December 5, 2022
Pdf extraction-single data Off-Topic Discussions studio	4	1179	July 22, 2019
Get Text : How to indicate particular element from pdf file? Activities pdf , activities , question	16	1108	October 11, 2022

Problem with data scraping from PDF file

Related topics