How to read through two large .csv files in an efficient way?

satu.nieminen · February 9, 2021, 8:39am

I’m working with my first UiPath task. I have two large .cxv files (about 5000 rows). Lets name them DT1 and DT2. Both have a same ID-number. DT1 contains also start date and end date. I should search all rows in the first one having a certain end date, it can be for example running day in this case. For each rows found I have to check whether that ID exists in the DT2 too. If the row doesn’t exist I should write the row from DT1 to the output. Could you recommend how to do this in the most efficient way. I have now been able to create a program reading all the rows that have a certain end date…but it is already quite slow. How to avoid increased slowness when adding the reading of the second file? I would appreciate your answers with examples. Thanks!

Pablito · February 9, 2021, 9:00am

Hi @satu.nieminen,
I would say that you could keep both files as Datatable variable and iterate through each table using some if statements.

satu.nieminen · February 9, 2021, 9:02am

So not to use Look up method?

Pablito · February 9, 2021, 9:04am

Lookup method is ok if you want to perform this inside the excel. You asked for efficient way so I assumed you want to perform it in Studio.

NIVED_NAMBIAR · February 9, 2021, 9:18am

Hi @satu.nieminen

Can u try with join query?

satu.nieminen · February 9, 2021, 10:29am

Yes, in Studio @Pablito

satu.nieminen · February 9, 2021, 10:32am

@NIVED_NAMBIAR Could you give me an example? I’m newbie using UiPath. Is it an
efficient way handling large files?

NIVED_NAMBIAR · February 9, 2021, 10:38am

Hi @satu.nieminen

Can u Share sample excel file?

satu.nieminen · February 9, 2021, 10:41am

@NIVED_NAMBIAR I can’t share the originals but I can try to create similar files later today.

NIVED_NAMBIAR · February 9, 2021, 10:41am

Please share when u prepare it

Also share the screenshot mode of output u need ?

satu.nieminen · February 9, 2021, 10:42am

@NIVED_NAMBIAR Yes I will.

ryan_glovsky · August 17, 2021, 11:52pm

you can use an online csv editor, and perform a join action for the 2 large tables. i use a tool called acho studio for this. For your use case, it sounds like you want to do a “vlookup” for the two large tables. you probably have to write sql queries for it.

Cristian_Negulescu · November 10, 2021, 8:59am

Hello Satu,
For large CSV files I have this movie:

Thanks,
Cristian

dVni · February 1, 2022, 3:08pm

Hi @Cristian_Negulescu ,

I tried your way to read a very large CSV file but it’s not working correctly when the separator is a semicolon. I tried adding “delim=59” (delimit on semicolon) to the connection string, but it is still just taking the first value in each row (so before the first “;” character). If the separator is comma like in your case, it works fine.

Any ideas?

Cristian_Negulescu · February 1, 2022, 6:04pm

Hello dVni,
The only Idea is to make Replace of characters outside before arriving at ODBC. Like this

The correct switch will be like this in my view:
Replace all “,” with " "(space)
Replace all “;” with “,”
and then ODBC should not have an issue.
Thanks,
Cristian

Topic		Replies	Views
How to handle large CSV files in UiPath (ODBC database) (10 million rows) Video Tutorials database , csv , large-excel-files	1	1898	November 10, 2021
Compare two CSV/Excel File Help	1	2703	October 19, 2017
How to compare two csv files Help datatable , excel , csv	3	4939	August 15, 2018
Efficient Data Table Query Help activities	7	4003	June 7, 2019
Lookup Data Table - 2 tables as a source StudioX lookup	4	247	April 14, 2024

How to read through two large .csv files in an efficient way?

Related topics