Unexpected Error occurred while digitizing the documents

I am classifying large amount of pdf files from different folders. These folders have many subfolders which contain pdf files along with other file types such as images and excel files. While running the bot on such large amount of data the OMNI PAGE OCR gives the following error:

Digitize Document: An unexpected error has occurred

Any help in this regard will be highly appreciated

Thank You

Regards

Enthusiastic

Hello @enthusiastic!

It seems that you have trouble getting an answer to your question in the first 24 hours.
Let us give you a few hints and helpful links.

First, make sure you browsed through our Forum FAQ Beginner’s Guide. It will teach you what should be included in your topic.

You can check out some of our resources directly, see below:

  1. Always search first. It is the best way to quickly find your answer. Check out the image icon for that.
    Clicking the options button will let you set more specific topic search filters, i.e. only the ones with a solution.

  2. Topic that contains most common solutions with example project files can be found here.

  3. Read our official documentation where you can find a lot of information and instructions about each of our products:

  4. Watch the videos on our official YouTube channel for more visual tutorials.

  5. Meet us and our users on our Community Slack and ask your question there.

Hopefully this will let you easily find the solution/information you need. Once you have it, we would be happy if you could share your findings here and mark it as a solution. This will help other users find it in the future.

Thank you for helping us build our UiPath Community!

Cheers from your friendly
Forum_Staff

Hi @enthusiastic,

Does it give error for all pdfs or just few pdfs?

Also how are trying to digitize all your documents? Using parallel for each loop or just for each?

Regards
Sonali

It gives error after executing two to three files…
I am digitizing and classifying documents in for each

Hi @enthusiastic,

With for each, it shouldn’t happen as it executes files one by one unlike parallel for each.

However, you can still try doing below:

  1. Instead of getting all files from folders/subfolders in a list and traversing through that, get just list of folders.
  2. Then traverse through each folder one by one, within that have another loop to traverse through files in that folder one by one. Inside this inner loop, have your code for digitizing and classifying…

try this and see if this resolves your issue.

Also, might be worth checking – if everytime bot is failing after same processing number of files, there might be a chance that particular files is damaged thus leading to the error. So to pinpoint this, print name if file being processed and identify bot is failing at which file and then may be just delete that file or move that file from that folder for a moment and see how it goes with rest of the files.

Plus, sometimes, same OCR is unable to digitize all kinds of files so you may use try catch to catch this exception and under catch activity, defining digitize activity with some other OCR to see if changing OCR resolved the issue or not.

Hope any of the these recommendations work for you.

Regards
Sonali