Sorry if someone asked before.
I have hundreds type of pdf files which might be portrait or landscape. (For example, the first page is portrait, other pages might be landscape or not.)
They might be look like an invoice, or data table or something else. I don’t know. Because my users will upload it.
I am trying to use UIPath and intelligent OCR Activities for automation. (I also tried ABBYY finereader, Omni OCR) When I give n pages of pdf to the flow, It fails. When I gave 1 page it reads. But if page was landscape, there might be character errors.
I think, I need to understand the pdf file portrait or not first. Or, do you have an idea how can I solve it? I will develop a web UI for my users. They are going to upload pdf documents with it. I can split and rotate at that point maybe, then I can give splitted files to uipath back? Or should I solve it in just UIPath side? What do you think?
I think my problem is because of landscape pages or multiple pages for now.
As a result, I want to digitize all the pdf file and extract “all the text data” to a .txt file without any wrong character error. Thats what I need exactly…
I saw so many tutorials, they are extracting spesific fields or something else. But I want all the data.
So, do you have any idea, how can I solve these problems?
Your help is much appreciated.