Microsoft Azure Computer Vision OCR returns incorrect 'Result' output

M_Kr · October 16, 2023, 12:09pm

I am using Microsoft Azure Computer Vision OCR in a ‘Read PDF With OCR’ activity. This engine is supposed to return 2 outputs: Text (the extracted string value) and Result (the extracted words along with their on screen position).

I tried using the result variable to get the position of some specific words, but the only value I get is one key value pair, where the key is the entire pdf text and the value is ‘X=0, Y=0, Width=0, Height=0’. Any ideas why this might be?

My PDFs are scanned and I can’t provide an example as they contain sensitive information.
In the properties of the Microsoft Azure CV OCR activity I have ExtractWords set to true and UseReadAPI also set to true. I have my Azure Computer Vision ApiKey and Endpoint set to their specific values. Everything else is left as the default value.

Yoichi · October 16, 2023, 2:22pm

Hi,

It seems ReadPDFwithOCR activity matter. As workaround, how about using ExportPDFAsImage and LoadImage as the following? (In my environment, it works)

Regards,

M_Kr · October 16, 2023, 2:49pm

works perfectly, thank you!

system · October 19, 2023, 2:49pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Microsoft Azure Computer Vision OCR activity not giving the same quality output as Azure Computer Vision Image Recognition website Help	18	4419	June 16, 2020
Is there a way to extract a table accurately from PDF with OCR Studio pdf , ocr , studio , question , activities_panel , pdf-extraction , microsoft-azure-computer-vision-ocr	0	862	October 12, 2023
Read PDF with OCR - ExtractWords doesn't change output Help pdf , ocr , activities	2	857	January 27, 2021
Microsoft Azure Computer OCR Engine errors Help ocr	3	1157	October 21, 2019
Microsoft Azure Computer Vision OCR Help activities	1	1430	October 25, 2019

Microsoft Azure Computer Vision OCR returns incorrect 'Result' output

Related topics