Computer Vision(CV)

abivanth.r · February 28, 2024, 2:10pm

Hi,
I am in need to open a pdf and get the text. However, I can’t use Anchor base activity or any other.
Can anyone please help me with CV?
Thanks…

lrtetala · February 28, 2024, 2:24pm

Hi @abivanth.r

Use Read pdf activity then by using Regex you can get the required value

vrdabberu · February 28, 2024, 2:25pm

Hi @abivanth.r

=> Use Read PDF Text or Read PDF with OCR if the PDF is scanned and store the output in a variable.
=> You can use regular expressions to extract the particular text you want.

Regards

abivanth.r · February 28, 2024, 2:29pm

The pdf is imaged and the language in that is chinese or japnese. i can’t use OCR too… data is getting deducted

lrtetala · February 28, 2024, 2:37pm

@abivanth.r

Try with Tesseract OCR it supports that languages

Cheers!!

abivanth.r · February 28, 2024, 2:40pm

used it too… i can get the data for one pdf. but having multiple pdf’s with structured data. i can’t use OCR… it’s not getting correct data.

postwick · February 28, 2024, 2:46pm

OCR never will. It’s not going to be 100% accurate.

Computer Vision is not a solution for this. Computer Vision is about identifying screen elements like buttons, text boxes, etc.

Document Understanding is what you want to use, but again it’ll still be OCR and not 100% accurate.

Topic		Replies	Views
Read hidden text from pdf using computer vision AI Computer Vision activities , computer_vision , question	11	2909	February 27, 2021
Scanned PDF files Help	8	3429	May 13, 2019
How to rad invoice number from scanned PDF Help studio	10	2232	November 7, 2019
Pdf Extract from OCR Text Task Capture	4	1689	August 15, 2020
OCR to extract pdf with google vision Activities ocr	3	1823	December 31, 2021

Computer Vision(CV)

Related topics