No valid data to run this pipeline

Hi,
I am trying to train the AI fabric EnglishTextClassification model with the Train Pipeline. But after sometime it kept on failing saying that “No valid data to run this pipeline.”(error details attached). I’m uploading a csv as dataset(attached it). I’m not exacly sure of the file format, but I’ve given it according to a video in youtube. Please look into it and revertfiles.zip (1.7 KB) . Also let me know where can I found these details like data format etc., as I searched for it but didn’t found anything.

Hi @kousaha1
If you look at the full logs (go to pipeline page, click three dots button on the grid, details, download full logs) you will see this line in your file:

2020-07-30 13:10:39,324 - aiflib.data_manager:info:15 - INFO: Provided data does not have enough training examples to train, you must provide at least 5 training examples of each class

On your dataset each class has only 3 data points.
Full documentation of model is there; English Text Classification although I see that this requirement is missing, we’ll add it.

Thanks,
Jeremy

1 Like

Thanks Jeremy. With this log message it was easy to figure out about the issue. But I didn’t expect to check for the logs in another place apart from ML logs. Anyway, thanks for sharing the documentation.

Can you please let me know whats the solution.I am getting same sort of error.
Please refer this post for details

Can you put target column first and then input. That might be the reason. Also check the log file location specified by Jeremy to get more details about the error, if changing the sequence of the columns does not help.

Same error.
I am trying to do email classification like spam and non-spam.
And I have used English text classification package.
First it showed “does not have name [input] in header’[‘type’, ‘text’]’, skipping this file. The csv file must contain a header with at least two columns. The column names are set by the <input_column> and <target_column> variables of this run. The default values are “input” and “target”. If the file contains other columns, they will be ignored” then is changed the columns as the error message shows then it shows this message
2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Failed to read csv [/data/dataset/sms_spam (1).csv] exception:
‘utf-8’ codec can’t decode byte 0xa3 in position 135: invalid start byte
2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.csv files in [/data/dataset]
2021-02-26 13:36:21,055 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.json files in [/data/dataset]
2021-02-26 13:36:21,055 - uipath_core.training_plugin:model_run:140 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: No valid data to run this pipeline.
2021-02-26 13:36:21,056 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: No valid data to run this pipeline.

Hi,

I am getting the same error while running the training pipeline. i used the package out of the box → Language Analysis - > English Text Classification

2021-10-11 10:51:10,153 - uipath_core.trainer_run:main:66 - INFO: Starting training job…
2021-10-11 10:51:11,373 - transformers.file_utils::38 - INFO: PyTorch version 1.5.1+cu101 available.
2021-10-11 10:51:12,892 - matplotlib.font_manager:_rebuild:1423 - INFO: Generating new fontManager, this may take some time…
2021-10-11 10:51:15,269 - uipath_core.storage.azure_storage_client:download:96 - INFO: Dataset from bucket folder training-726b3831-f6b8-4f9a-a037-296e655d67eb/b3a99338-6417-40de-8b8c-049b841d1b9e/00553494-de4a-4ab1-8ad2-35c85076e145 with size 1 downloaded successfully
2021-10-11 10:51:15,269 - uipath_core.training_plugin:train_model:109 - INFO: Start model training…
2021-10-11 10:51:15,270 - uipath_core.training_plugin:initialize_model:103 - INFO: Start model initialization…
2021-10-11 10:51:15,271 - uipath_core.training_plugin:initialize_model:106 - INFO: Model initialized successfully
2021-10-11 10:51:15,271 - aiflib.data_manager:info:15 - INFO: Loading data from /data/dataset…
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Failed to read csv [/data/dataset/train.csv] exception:
‘utf-8’ codec can’t decode byte 0x96 in position 679: invalid start byte
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.csv files in [/data/dataset]
2021-10-11 10:51:15,276 - aiflib.data_manager:info:15 - INFO: Unable to read any valid data from *.json files in [/data/dataset]
2021-10-11 10:51:15,276 - uipath_core.training_plugin:model_run:145 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: No valid data to run this pipeline.
2021-10-11 10:51:15,277 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: No valid data to run this pipeline.
Traceback (most recent call last):
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/trainer_run.py”, line 76, in main
wrapper.run()
File “/microservice/training_wrapper.py”, line 57, in run
return self.training_plugin.model_run()
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 146, in model_run
raise e
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 138, in model_run
self.run_train_only()
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 207, in run_train_only
self.train_model(self.local_dataset_directory)
File “/home/aifabric/.local/lib/python3.6/site-packages/uipath_core/training_plugin.py”, line 111, in train_model
self.model.train(directory)
File “/microservice/train.py”, line 11, in train
self.model.train(directory)
File “/microservice/aiflib/model.py”, line 211, in train
raise UiPathUsageException(‘No valid data to run this pipeline.’)
aiflib.logger.UiPathUsageException: No valid data to run this pipeline.