Splitting by delimiter in Datatable for each row

ktanli · September 12, 2020, 6:27pm

Hi, I was wondering how to split for each row in a datatable. For example, a column of my datatable looks as follows: . I would like to know how to split them by space without having to read the excel range (if that is possible).

The process I took was as follows:

I made a datatable using regex to find information in a PDF file.
The second column consists of the regex information (which happens to look like the image above)
I read that onto an excel sheet.

So my question is in a sense, is like how to do text to column in excel, but the datatable version.

Thank you!

AkshaySandhu · September 12, 2020, 9:17pm

Hello @ktanli
have you tried Generate DataTable activity…?

ktanli · September 12, 2020, 9:20pm

I have, it did not work out so well. I cannot send my workflow, but my data table looks something like this: There are also null values in the second column in some rows. Generate Datatable does not seem to like the null values.

AkshaySandhu · September 12, 2020, 9:23pm

can you give some sample text or attach text file here

If I understood your requirement correctly, you need datatable something like this

ktanli · September 12, 2020, 9:31pm

Yeah! Something like that! It’s like if it is on Excel you would do text to column with the space as delimiter. So in the end it should look like:

ktanli · September 12, 2020, 9:33pm

I am not sure about attaching a text file because the way I did it was I had to use RegEx in order to find the second column values (hence why they are null values because if the bot cannot find a value in Column 1, it would leave the corresponding Column 2 cell blank).

AkshaySandhu · September 12, 2020, 9:42pm

actually I dont know the exact use case of Regex here,
but when I tried with sample data like this (there is no space after “B”)

Generate DataTable activity gave me result like this

issue could be related the regex (I am not sure though)

AkshaySandhu · September 12, 2020, 9:49pm

could you try to dump dummy PDF text here. Text value before using any regex

ktanli · September 12, 2020, 9:58pm

I cannot attach the PDF because it is for work. But let me give you a rough idea what I had to do, when I read the pdf to a text file it looked something like this: . The idea around it was that I needed to separate the numbers from the "first column (A, C, Sample A, etc. The problem was that when it says Sample A it would split that into 2 columns. So I did regex that looks something like this: . This was an example from a video I followed along. Now what ended up happening was the image I attached above. The first column was one thing, and the numbers are part of another column.

Now, aside from the PDF extracted, there is actually a set list of variables in column 1. Some may not be in the PDF file itself so for example:

A
B
C
Sample A
Sample B
Sample C
Sample D
E
F

Regex could not find “Sample B” and “E”, so hence it leaves it blank on the data table.

ktanli · September 12, 2020, 10:01pm

So in short, the use of Regex was to separate the first column, and the values that were put in the second column. I just need to figure out a way how to delimit the second column by space.
I was thinking something along the lines of: For every row in Column2, split the string… or something like that.

AkshaySandhu · September 12, 2020, 10:07pm

can you confirm one thing.
in column0 i.e. A,B,C Sample A etc.
all the values will be alphabets or they could be alpha-numeric

ktanli · September 12, 2020, 10:08pm

Yes. When you mean alpha numeric do you mean: 1A?

AkshaySandhu · September 12, 2020, 10:09pm

yes…

ktanli · September 12, 2020, 10:10pm

Yes, then some values in Column0 are alpha numeric but it always ends with a letter.

AkshaySandhu · September 12, 2020, 10:11pm

ok then give me some time…

ktanli · September 12, 2020, 10:12pm

Got it! Thank you!

ktanli · September 12, 2020, 10:29pm

Also, I managed to get some of it, but as soon as it hits a blank row it does not want to continue.
This was what I did:

. So, it quite works but not when the row is blank, is there a way to go about this?

AkshaySandhu · September 12, 2020, 10:31pm

try with this xaml
Main.xaml (6.8 KB)

OfcUrlCategory.txt (118 Bytes) [sample data that I used]
Note: this will replace " " [space] with “_” [underscore] in first column

ktanli · September 12, 2020, 10:48pm

It worked out! Thank you!

Also just to clarify a few things, I am not very good a RegEx:

→ what does {0,10} mean?.

And basically, find any values in the text file that has a letter in the front (and not all numbers), then put an underscore.

AkshaySandhu · September 12, 2020, 10:53pm

that pattern is finding the part of string which starts alphanumeric and ends with alphabets only…
example:

you can refer below link for steps by step explaination

Topic		Replies	Views
Split specific data in string Activities excel , activities , question	15	1137	April 24, 2021
String split with no delimiter Studio string-manipulation	8	1161	February 14, 2023
Convert pfd to excel (use regex) Activities excel , pdf , activities , regex , question	14	1629	October 20, 2021
Delimiter text using regex Help	25	2771	November 11, 2019
Datatable row data to split in other column if found space Help uiautomation , activities	10	3290	July 25, 2019

Most Active Users - Yesterday
Anil_G
ashokkarale
prashant1603765
sonaliaggarwal47
Juan_Pablo_Ortiz
yedukondaluaregala
Rahul_Rajendran
sharazkm32
gorby
V_Roboto_V
More details...

Splitting by delimiter in Datatable for each row

Related topics