I have a question regarding model training when we have huge number of different vendors (~1000).
It seems unrealistic to label 10-15 documents for each vendor in this case - dataset will be huge and it will require a lot of effort to label, also training will take very long time.
What would be the best approach in this situation?
Anyone had this issue and was able to find solution that worked?