I have +100 vendor invoice’s and I have 100 sample for each vendor. Now I am trying to extract the needed information from them using ML and while creating the dataset and labeling them and then train the model through the pipeline it takes a lot of time and effort and I think the maintains is very difficult. Can you imagine if one vendor change an field or something like that.
My question is : Should I make a different Dataset, Data Labeling, ML Package, Pipeline,etc… for each vendor? it looks much easier this way and I think by doing so, if I am having an issue with the accuracy with one vendor, I can focus on that and get a better results.
What do you think?