Is there any option to disable this limit? Reducing the number of epochs is not a good solution.
What kind of pipeline are you running that takes >7 days to complete?
IMO that’s just a waste of resources and you should check if you can reduce this time somehow.
I am trying to learn a new model and I cannot run the pipeline on the GPU. It reads around 200 data from one pdf, so one epoch takes a long time.