Need help to put me on the right path towards the best solution for this:
We need to extract a per item data (Item Name/Description/Number, Amount) from hundreds of different scanned invoices (images) with varying structures.
Maybe just an advise on the general overview of the workflow and the UiPath technologies to be used would be a great help already. If there are also advises for external solutions that can be purchased, we are also open to that option (although I would like to try doing it on my own first to understand it better).
For your scenarios, I guess Document Understanding is the right one since it has the capability to extract the data from different documents (scanned, .png, .tiff etc) using AI capabilities. Also, in DU there are different extractors available to extract the data from the docs
Intelligent Form Extractor : It is suitable until and unless the layout of the document and alignment of the data remains same. It is also capable to identify signature and handwritten fields
Form Extractor : It is similar to the above one but it doesn’t have the capability to identify handwritten or signature fields
Regex based Extractor : It is suitable for small use cases
ML Extractor : It is the most advanced one and is more suitable for the documents coming with higher volume or most unstructured formats