I’m trying to get information from PDF file.
- “Read PDF” works, but the problem s that it gives me unstructured data (mix of numbers and letters)
- “Get text” captures by block, not by specific field (which is not helpful for me)
- Screen scraper’s:
“full text” works as read “red PDF”, no structure, mixes number and letters
“Microsoft OCR” does not recognize text
“Google OCR” recognize badly
“NAtive” recognizes very well, however, when im trying to output data through “message box” or “write line” it gives me an empty field.
The question is how can I output data through native scraping method from PDF file.
P.S. on some forum topics I saw Regex and split on spaces, what are those methods?