Can't get words position using Microsoft OCR and Read PDF with OCR

MGMKLML · April 12, 2018, 9:46pm

Hello.

I’m trying to do the following thing. I have a scanned document in the pdf format. I use “Read PDF with OCR” activity plus Microsoft OCR. There is a possibility of extracting KeyValuePair having used the Microsoft OCR thing. I store it in a variable and then generate the data putting the variable to “input” → “positions”. When I use “Output Data Table” to see the result, it contains only one column with the extracted pdf text. There are no each word positions. Is it possible to fix it?

arathi · April 13, 2018, 10:56am

Can you share your xaml and the pdf

MGMKLML · April 13, 2018, 11:21am

Yeah, sure I’ll do it in 5-6 hours, just don’t have an access to my computer right now.

MGMKLML · April 13, 2018, 4:02pm

@arathi Here it is.
Main.xaml (11.3 KB)
109970.pdf (502.7 KB)

arathi · April 17, 2018, 7:38am

hi this is what I am getting after reading the pdf. I have tried using language “rus” and “russian”. Let me know whats the result for you

Good day

MGMKLML · April 17, 2018, 7:52am

Hey :-). Thank you for your attempt. My results are a bit better if I use “Russian” with scale range of 0.7-1. But still I can’t get the word positions.

Topic		Replies	Views
Read PDF with OCR - ExtractWords doesn't change output Help pdf , ocr , activities	2	779	January 27, 2021
Is there a way to extract a table accurately from PDF with OCR Studio pdf , ocr , studio , question , activities_panel , pdf-extraction , microsoft-azure-computer-vision-ocr	0	586	October 12, 2023
Reading PDF with OCR unable to handle variable PDF position Help pdf , ocr	0	1034	September 5, 2018
What principle do the activities use to extract the words positions in background? Help pdf , ocr , activities	2	1247	July 26, 2018
Microsoft Azure Computer Vision OCR returns incorrect 'Result' output Activities ocr , activities , question , azure	3	415	October 16, 2023

Most Active Users - Yesterday
ashokkarale
ppr
Anil_G
Ajay_Mishra
Yoichi
mhaniff
Shiva_Nikhil
Anonymouss
quick_123
vrdabberu
More details...

Can't get words position using Microsoft OCR and Read PDF with OCR

Related Topics