No valid data to run this pipeline

kousaha1 · July 30, 2020, 11:19am

Hi,
I am trying to train the AI fabric EnglishTextClassification model with the Train Pipeline. But after sometime it kept on failing saying that “No valid data to run this pipeline.”(error details attached). I’m uploading a csv as dataset(attached it). I’m not exacly sure of the file format, but I’ve given it according to a video in youtube. Please look into it and revertfiles.zip (1.7 KB) . Also let me know where can I found these details like data format etc., as I searched for it but didn’t found anything.

Jeremy_Tederry · July 30, 2020, 3:34pm

Hi @kousaha1
If you look at the full logs (go to pipeline page, click three dots button on the grid, details, download full logs) you will see this line in your file:

2020-07-30 13:10:39,324 - aiflib.data_manager:info:15 - INFO: Provided data does not have enough training examples to train, you must provide at least 5 training examples of each class

On your dataset each class has only 3 data points.
Full documentation of model is there; English Text Classification although I see that this requirement is missing, we’ll add it.

Thanks,
Jeremy

kousaha1 · July 31, 2020, 10:08am

Thanks Jeremy. With this log message it was easy to figure out about the issue. But I didn’t expect to check for the logs in another place apart from ML logs. Anyway, thanks for sharing the documentation.

Harshit_Tanted1 · July 31, 2020, 1:40pm

Can you please let me know whats the solution.I am getting same sort of error.
Please refer this post for details

kousaha1 · August 1, 2020, 4:50am

Can you put target column first and then input. That might be the reason. Also check the log file location specified by Jeremy to get more details about the error, if changing the sequence of the columns does not help.

hari99 · February 26, 2021, 2:07pm

Same error.
I am trying to do email classification like spam and non-spam.
And I have used English text classification package.
First it showed “does not have name [input] in header’[‘type’, ‘text’]', skipping this file. The csv file must contain a header with at least two columns. The column names are set by the <input_column> and <target_column> variables of this run. The default values are “input” and “target”. If the file contains other columns, they will be ignored” then is changed the columns as the error message shows then it shows this message
“2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Failed to read csv [/data/dataset/sms_spam (1).csv] exception:
‘utf-8’ codec can’t decode byte 0xa3 in position 135: invalid start byte
2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.csv files in [/data/dataset]
2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.json files in [/data/dataset]
2021-02-26 13:36:21,055 - uipath_core.training_plugin:model_run:140 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: No valid data to run this pipeline.
2021-02-26 13:36:21,056 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: No valid data to run this pipeline.”

kramachandran · October 11, 2021, 11:45am

Hi,

I am getting the same error while running the training pipeline. i used the package out of the box → Language Analysis - > English Text Classification

2021-10-11 10:51:10,153 - uipath_core.trainer_run:main:66 - INFO: Starting training job…
2021-10-11 10:51:11,373 - transformers.file_utils::38 - INFO: PyTorch version 1.5.1+cu101 available.
2021-10-11 10:51:12,892 - matplotlib.font_manager:_rebuild:1423 - INFO: Generating new fontManager, this may take some time…
2021-10-11 10:51:15,269 - uipath_core.storage.azure_storage_client:download:96 - INFO: Dataset from bucket folder training-726b3831-f6b8-4f9a-a037-296e655d67eb/b3a99338-6417-40de-8b8c-049b841d1b9e/00553494-de4a-4ab1-8ad2-35c85076e145 with size 1 downloaded successfully
2021-10-11 10:51:15,269 - uipath_core.training_plugin:train_model:109 - INFO: Start model training…
2021-10-11 10:51:15,270 - uipath_core.training_plugin:initialize_model:103 - INFO: Start model initialization…
2021-10-11 10:51:15,271 - uipath_core.training_plugin:initialize_model:106 - INFO: Model initialized successfully
2021-10-11 10:51:15,271 - aiflib.data_manager:info:15 - INFO: Loading data from /data/dataset…
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Failed to read csv [/data/dataset/train.csv] exception:
‘utf-8’ codec can’t decode byte 0x96 in position 679: invalid start byte
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.csv files in [/data/dataset]
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.json files in [/data/dataset]
2021-10-11 10:51:15,276 - uipath_core.training_plugin:model_run:145 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: No valid data to run this pipeline.
2021-10-11 10:51:15,277 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: No valid data to run this pipeline.
Traceback (most recent call last):
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/trainer_run.py”, line 76, in main
wrapper.run()
File “/microservice/training_wrapper.py”, line 57, in run
return self.training_plugin.model_run()
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 146, in model_run
raise e
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 138, in model_run
self.run_train_only()
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 207, in run_train_only
self.train_model(self.local_dataset_directory)
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 111, in train_model
self.model.train(directory)
File “/microservice/train.py”, line 11, in train
self.model.train(directory)
File “/microservice/aiflib/model.py”, line 211, in train
raise UiPathUsageException(‘No valid data to run this pipeline.’)
aiflib.logger.UiPathUsageException: No valid data to run this pipeline.

Topic		Replies	Views
AI Fabric Pipeline Failed AI Center	2	1527	February 7, 2021
Can't run a pipeline: No valid data to run this pipeline AI Center question , ai_center	3	148	April 22, 2024
Pipeline failed due to ML Package Issue \| Using default EnglishTextClassification and CSV AI Center question , ai_center	1	2126	May 4, 2021
AI Center Pipeline Failed Knowledge Base ai_center , ai_fabric , ai-fabric , ai-center	0	33	January 3, 2025
English text classification dataset AI Center orchestrator , ai	20	2861	February 25, 2021

No valid data to run this pipeline

Related topics