How can we solve this issue?
I convert pdf files into text files and then generate data table to get datable. But the generate data table output is not formatted as I want.
This is the generated data using default setting:
And this is the format I want:
I want the second column like the format : AYA/MDY/MCG/22000130 (sample customer code).
whatever there is three or four or five phase, I want to start the second column with the customer code. How can I fix the issue? If there is any solution or work around, please let me know. Thanks.
If possible, can you share your input text as a file?
In Generate datatable wizard, for column seperator use Tab, by default it will be space
Hi bro, please see the text file.
Page6.txt (1.1 KB)
the text files formats are the same but sometimes the generate data table activity works as the format I want but sometimes it separates the name into two or three columns.
How about the following sample?
- Remove whitespaces at the begining of each line
- replace 2 or more whitespaces to chr(9) (Tab)
Then, generate datatable from the above as tsv.
Sample20220818-5.zip (2.9 KB)
The solution works only when there is only one space between the name. If there is two or more spaces in the name, how we can fix this. please share with me. thanks.
Can you share specific sample data?
Please receive the text file.
Page114.txt (5.1 KB)
Thanks for your help. I really appreciate it.
I modified the pattern as the following.
And as there is line turn over in some rows, I add some logic for fix it. (outputs result2.xlsx)
Sample20220818-5v2.zip (4.4 KB)
Many thanks for your help. It’s working well and this is the solution I want.
How can I ignore the first line that is not formated as table format like below:
In this screenshot I want to ignore “LIMITED” and want to extract all of the following data. Is this possible to get the data? If there is any solution, please help me. I also uploaded the text file. Best Regard.
Page108.txt (4.9 KB)