ive been using DU for few weeks to test Invoices. I’ve run multiple training sets.
However, every time i ran my code confidence is low most of the time for some fields (below 50%)
I kept repeating by training more sets, but I’ve been keep getting the same result.
My question is, is there an alternative to Document Understand for invoices? are there libraries/packages which i could use which may have better results??
By reading the post and comments, I figured that you are using the ML extractor.
To address this scenario, we can work on several things as follows.
As you know, all the ML models available are retrainable. In case you are using endpoints for ML extractor and getting low confidence, I suggest you switch to AI Center ML models as it is retrainable.
Initially, the confidence may be low. But the good thing is, you can use the Data Manager and start training with a good number of initial document sets. This will definitely increase the accuracy (confidence). However, it may require multiple training runs (may depend on the number of initial documents you provide too).
There after, you can keep on fine-tuning the data even more where it gets low confidence. This is possible through the Document Understanding workflow itself.
The videos here may give you some tips on training the models for better accuracy…
Also, the tips given to you by our friends will also help for sure… There are many methods that’s available for us
They are the alpha when it comes to intelligent document parsing. Rossum is easy to setup and make custom connectors all in python. It is also quite straightforwad to integrate with any RPA tool.