Bug Microsoft OCR

Dragos_Cioata · August 26, 2019, 2:13pm

Hello,
I have a interesting bugg. Let’s say we have 2 pdfs( pdf1 and pdf2).
If I extract the data with OCR- Microsoft for pdf1 i have a result. If i extract data first from pdf2 and after for pdf1, i have a diferent result for pdf1.
Thx

Palaniyappan · August 26, 2019, 2:25pm

Hmm this is something interesting
may i know how you get like it differs
Cheers @Dragos_Cioata

Dragos_Cioata · August 26, 2019, 2:29pm

Well, the extracted text contains the same information, but the position of the information is different.
I did not observe a particular rule.

Palaniyappan · August 26, 2019, 2:31pm

well thats common in OCR or with the speed of the process been executed
but it will be same when we are trying to extract a specific details with string manipulation from either of the output
thats common buddy, no need to worry on it, and when we are using a similar string manipulation expression you can get the same output from both the text obtained from their pdf’s

Cheers @Dragos_Cioata

system · August 29, 2019, 2:31pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to extract specific data from multiple pdf using ocr? StudioX studiox , question	4	1442	July 8, 2021
Issue with PDF extraction by OCR Help activities , studio	2	895	May 8, 2019
Read PDF with OCR - ExtractWords doesn't change output Help pdf , ocr , activities	2	781	January 27, 2021
PDF OCR Problem in extracting a single numeric character Document Understanding ocr , feedback	1	1055	June 29, 2021
Get ocr text not returning correct text Help	0	729	February 3, 2020

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

Bug Microsoft OCR

Related Topics