Background:
I have to extract the correct date information from the bill with using the document understanding in AI Center. I have created an ML Skills and upload around 50 documents into the dataset for pipeline trainings. After about 15 times of training, I can get the satisfied results for most samples.
However, some samples are still failed to get the correct result as the month and day values are mismatched. Here is the example:
The expected Bill Date should be 2021-08-07. For the Due Date of the same document, it can output the right value.
After more experiments, it is found that if the month and day values of Bill Date are interchangeable, i.e. 01-12, the failure rate is higher.
In order to overcome this issue, what should I do to improve the accuracy of model? Is it useful if I retrain the ML Skill with the same dataset for multiple times, e.g. 10 times?
As I am new to the document understanding and AI Center, please share your experience with me if you have any idea to this topic. Thank you very much!