Extracting text from a .txt document

sai1 · December 7, 2017, 9:52pm

I extracted pdf (catalog containing item numbers and other description) content to a .txt file. I need to extract only the item numbers from .txt file and search for that item number in a website. How can I extract only the item numbers from the .txt file?

ClaytonM · December 7, 2017, 10:14pm

Hi,

Are the item numbers in the same spot consistently? If so, you can .Split the text up then pull that column from the array and store it back to an Array of numbers. If it’s not consistent, then you can use Regex to pull the numbers and again store it to an array.

Here are both examples to assign the list of numbers to an Array[string] variable:

text.Split(vblf(0)).Select(Function(row) row.ToString.Trim.Split({" "},System.StringSplitOptions.RemoveEmptyEntries)(0)).ToArray

text.Split(vblf(0)).Select(Function(row) System.Text.RegularExpressions.Regex.Match(row.ToString.Trim,"[0-9]{1,4}").Value).ToArray

They might need some work though.

Alternatively, you can run a ForEach row In text.Split(vblf(0)) as Argument type String, and use Assign activity to replace each row with the number that’s in that row

Hope this helps.

Regards.

Topic		Replies	Views
Get text using Regex Activities pdf , activities , question	7	724	June 12, 2022
Extract specific String Help activities , question	15	1010	January 14, 2021
PDF Text to Table Column Activities pdf , activities , question	4	185	December 15, 2023
Text extraction Help	4	1263	February 9, 2018
RegEx for Text File Studio studio , question , activities_panel	44	573	September 28, 2023

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

Extracting text from a .txt document

Related Topics