It would be great if you could help me out for the below scenarios.
- Is there an intelligence to check if a pdf page is in portrait view or landscape view. In a pdf, some pages are in portrait view and some in landscape view. I need to read the text in that pdf using OCR. Any suggestions?
- I tried to extract text from a structured pdf document. I need the text from all the pages - Tabular and non-tablur formatted text. Below are the options i tried but it doesnt help. Let me know if we can achieve this by any other ways.
2a) “Read pdf with OCR” (With choosing inverted option and without choosing inverted options were tried) - Returns empty result.
2b) Read Pdf text - output is empty
2c) Scraping helps. But how do we know the number of pages and how to extract text from all the pages?
- I am trying to extract text from a pdf and trying to move it to another folder. But it says “The process cannot access the file because it is being used by other process”. How do we resolve it? The document is not open anywhere else.