I am looking into a requirement, where I need to extract specific data from different invoice (scanned images in PDF format) files uploaded globally into an invoice management system.
The content(/image) in scanned invoice file could be in English or any native language/dialect or a combination of both.
Using UiPath, how can we identify the native language/dialect used in a random invoice file from a batch of invoices, dynamically during runtime/production, read/extract the target data using an anchor element(which is also in native dialect).
I have come across Document & text translator component in Uipath Go that can translate text to the language we specify.
But I am looking at the possibility of identifying which foreign language the invoice file content is in & translate it back to English for further processing.
Any suggestions/guidance highly appreciated.