Can't train ML Package with manually uploaded files

pedro.cavadas · January 12, 2021, 3:45pm

I tried training a ML Package in AI Fabric without using the dataset created in the “Train Machine Learning Classifier”, so I manually uploaded the documents in .pdf and .jpg files into a dataset I created. But when I try to run Full Pipelines using these files, I get the following error:

File “/opt/conda/lib/python3.7/multiprocessing/pool.py”, line 121, in worker
result = (True, func(*args, **kwds))
File “/microservice/classification/text/preprocess.py”, line 30, in _get_words_from_text_file
parsed_text = file.read()
File “/opt/conda/lib/python3.7/codecs.py”, line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0x89 in position 0: invalid start byte

How can I solve this?

system · January 14, 2021, 4:00pm

Hello @pedro.cavadas!

It seems that you have trouble getting an answer to your question in the first 24 hours.
Let us give you a few hints and helpful links.

First, make sure you browsed through our Forum FAQ Beginner’s Guide. It will teach you what should be included in your topic.

You can check out some of our resources directly, see below:

Always search first. It is the best way to quickly find your answer. Check out the icon for that.
Clicking the options button will let you set more specific topic search filters, i.e. only the ones with a solution.
Topic that contains most common solutions with example project files can be found here.
Read our official documentation where you can find a lot of information and instructions about each of our products:
Watch the videos on our official YouTube channel for more visual tutorials.
Meet us and our users on our Community Slack and ask your question there.

Hopefully this will let you easily find the solution/information you need. Once you have it, we would be happy if you could share your findings here and mark it as a solution. This will help other users find it in the future.

Thank you for helping us build our UiPath Community!

Cheers from your friendly
Forum_Staff

Topic		Replies	Views
DU failing with error AI Center question	1	98	July 4, 2024
Error: Pipeline failed due to ML Package Issue AI Center question , ai_center	11	2376	December 8, 2022
Unable to Train ML Package - EnglishTextClassification via Pipeline AI Center question , ai_center	6	460	October 24, 2023
No valid data to run this pipeline AI Center	6	2487	October 11, 2021
AI Fabric Model Train Pipeline Failed AI Center question , ml , document_understanding	9	2304	February 2, 2021

Can't train ML Package with manually uploaded files

Related topics