I tried uploading our source data in multiple chunks say i have 3 case_raw files with data. Now I need to upload all three files into single custom process app. All the 3 case file data should get appended in single case_raw file. When i checked uploading into Dev data of process app, the case file is getting overwritten with data. is it possible or is there any other way to get all 3 file data in single case_raw file (append with next file data and not getting overwritten).
This documentation helped us in merging multiple event logs. My use case scenario has multiple case raw files. In which each case files has 10000 records and 3 such case files have to be merged into single case raw file inside process mining custom app.
The same approach as described for event_logs can also be used for merging together multiple case_raw files. All files will have to get a different name, like Cases_raw1, Cases_raw2, cases_raw3.
The SQL for the Cases_raw file then would look like this.
-- The following code merges 3 input tables.
select * from {{ ref('Cases_input_1') }}
union all
select * from {{ ref('Cases_input_2') }}
union all
select * from {{ ref('Cases_input_3') }}
When we code as above - 3 case input files in main case_log_input. During compilation, it expects first case file to be compiled but that cannot be possible. facing issue as āmain case file depends on node named case_raw1, which was not foundā.
the input sql file on each case file is with different source tables respectively. the main case sql file is named as āFinal_Case_raw.sqlā with below code snippet:
select * from {{ ref(āCases_rawā) }}
union all
select * from {{ ref(āCases_raw2ā) }}
union all
select * from {{ ref(āCases_raw3ā) }}
Also, I have added the new case tables to the sources.yml file as below:
As we have signed SOW on this prospect, we might not able to share the entire data here. So I explained you the scenario with dummy data on the above reply.
Can you let me know what are the basic step checks for the above error. I also checked the sqls are correct with source tables. The data model doesnāt have cases_raw2, since it will not have primary key. Are there anything to investigate more.
Thanks for clarifying, and thatās the source of confusion. I think we should have been clearer when we tried to say that the files must be in the model.
The screenshot you are showing is part of the output data model and the raw filed need not be added there.
Do you have the four cases files defined as part of the Transformations>models>1_input?
Cases_raw.sql
Cases_raw2.sql
Cases_raw3.sql
Final_case_raw.sql
like the following screenshot?
The Final_case_raw.sql will be the combination of all the three files as you had written earlier but the individual Cases_rawX.sql needs to model the input from the source file.