In a web portal whenever we click on the documents, it will fetch a document where we need to extract the required data. But the document is different for different candidates in a list however the required data is same in all documents but the location is different (i.e. may be in multiple pages).
Determine the document type, and store the document type in a string variable. Then use a Switch and in each Case code the steps for each document type.
I know the document types, they are PDFs only And there are more than 5 different documents.
They are digital documents and scanned as well.
We have anchor text for the data which needs to be extracted.
We are extracting Account Holder Name, Account Number and IFSC from the bank statements or cancelled cheque leaf mostly. The statements are digital document type only but cheque leaf is a scanned one.
The Document types vary + Scanned documents may also be present, which would bring us to the conclusion of using Intelligent Document processing methods Like Document Understanding.
We do already have some out of the box UiPath DU packages for Bank Statements and Cheques, Could you check on these Packages and let us know if it is able to extract the required details.