2021-06-03 14:20:01,383 - uipath_core.trainer_run:main:66 - INFO: Starting training job... 2021-06-03 14:20:04,792 - uipath_core.storage.azure_storage_client:download:95 - INFO: Dataset from bucket folder training-b18b9310-cc2f-4e7a-bfd9-2c73ed291971/f68d24f3-0bf8-4d1a-965b-a1229960be6f/c600ff0f-4cb3-41c6-b7be-e392ba99676d with size 19 downloaded successfully 2021-06-03 14:20:04,792 - uipath_core.training_plugin:train_model:104 - INFO: Start model training... 2021-06-03 14:20:04,792 - uipath_core.training_plugin:initialize_model:98 - INFO: Start model initialization... 2021-06-03 14:20:04,794 - root:_valid_doctype_folder_structure:63 - ERROR: images/ directory does not exist / is empty for {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} dataset 2021-06-03 14:20:04,794 - uipath_core.training_plugin:model_run:140 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure. 2021-06-03 14:20:04,797 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure. Traceback (most recent call last): File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/trainer_run.py", line 76, in main wrapper.run() File "/microservice/training_wrapper.py", line 57, in run return self.training_plugin.model_run() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 141, in model_run raise e File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 133, in model_run self.run_train_only() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 202, in run_train_only self.train_model(self.local_dataset_directory) File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 106, in train_model self.model.train(directory) File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 94, in model self.initialize_model() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 100, in initialize_model self._model = train.Main() File "/microservice/train.py", line 23, in __init__ self.opt = self.get_options() File "/microservice/train.py", line 96, in get_options opt = preprocess.configure_options(opt) File "", line 98, in configure_options Exception: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure. 2021-06-03 14:20:12,552 - uipath_core.trainer_run:main:66 - INFO: Starting training job... 2021-06-03 14:20:15,569 - uipath_core.logs.upload_log_service:upload_logs_file:56 - INFO: Retry Training Triggered: 2021-06-03 14:20:15,889 - uipath_core.storage.azure_storage_client:download:95 - INFO: Dataset from bucket folder training-b18b9310-cc2f-4e7a-bfd9-2c73ed291971/f68d24f3-0bf8-4d1a-965b-a1229960be6f/c600ff0f-4cb3-41c6-b7be-e392ba99676d with size 19 downloaded successfully 2021-06-03 14:20:15,889 - uipath_core.training_plugin:train_model:104 - INFO: Start model training... 2021-06-03 14:20:15,890 - uipath_core.training_plugin:initialize_model:98 - INFO: Start model initialization... 2021-06-03 14:20:15,891 - root:_valid_doctype_folder_structure:63 - ERROR: images/ directory does not exist / is empty for {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} dataset 2021-06-03 14:20:15,891 - uipath_core.training_plugin:model_run:140 - ERROR: Training failed for pipeline type: TRAIN_ONLY, error: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure. 2021-06-03 14:20:15,892 - uipath_core.trainer_run:main:81 - ERROR: Training Job failed, error: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure. Traceback (most recent call last): File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/trainer_run.py", line 76, in main wrapper.run() File "/microservice/training_wrapper.py", line 57, in run return self.training_plugin.model_run() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 141, in model_run raise e File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 133, in model_run self.run_train_only() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 202, in run_train_only self.train_model(self.local_dataset_directory) File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 106, in train_model self.model.train(directory) File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 94, in model self.initialize_model() File "/home/aifabric/.local/lib/python3.8/site-packages/uipath_core/training_plugin.py", line 100, in initialize_model self._model = train.Main() File "/microservice/train.py", line 23, in __init__ self.opt = self.get_options() File "/microservice/train.py", line 96, in get_options opt = preprocess.configure_options(opt) File "", line 98, in configure_options Exception: Document type {'name': 'invoices', 'folder': '', 'language': 'en', 'dataset': {'account_name': None, 'folder': '', 'path': '/microservice/dataset', 'dataloader_workers': 0, 'vocabulary_padding_id': 0, 'vocabulary_unknown_id': 1, 'text_pp_remove_symbols': False, 'text_pp_lemmatization': False, 'text_pp_remove_stop_words': False, 'word_embedding': 'unknown_id', 'max_words': 10000, 'max_image_size': [300, 300], 'date_format_classifier_data': ['receipts', 'invoices', 'invoices_au', 'invoices_india', 'utility_bills', 'purchase_orders', 'invoices_japan', 'unknown'], 'replace_patterns': ['date', 'number', 'checkbox'], 'doctype2id': {}, 'id2doctype': {}}, 'path': '/microservice/dataset/', 'split': '/microservice/dataset/split.csv', 'schema': '/microservice/dataset/schema.json'} not valid, check that document type data is in dataset folder and follows folder structure.