-Being reading a few post as usual about the the current scenario before sending the post but none specifically about ‘Not reading/continuing’ after a specific page number.
-Our previous question has been answered already: so no page limits on the OCR framework, great!
-so why does it stops?
•Context: we’re using Document Understanding framework to read a 50 pages or so PDF.
-The OCR engine (digitize section) does not go any further than page 6!
-The page where it stop is almost ‘Blank’ and the resto of PDF continues with pages with regular
info again on it (in case it helps)
-It just stops reading the rest, How do we know? (see ‘Steps taken’)
-In Debug mode using a ‘Break point’ (a few steps after PDF has been Digitized) we checked
the String of Text made as a result from ‘Digitize’ section and does not have all the info
expected from the DOC.
-We tried with a smaller PDF (2 pages) and we get all the info on that String as expected.
•Engine currently used: I will say ‘all of them’. We just tried all engines yesterday.
-Weird isn’t it?
-So we already open a ticket with the guys from Enterprise support (waiting response), we’ve
been checking post on the forum as usual but is never bad to get as much feedback/ideas as
-Unfortunately, we can’t share the doc because confidentiality
PS, I just edited this post now with new info from Yesterday testing all engines on the Studio.
Feel free to send any thoughts, Stay Safe!