I cant Extract pdf some times using CV

I having an issue while extracting PDFs in CV; sometimes the alignment and table are in different positions. is there any solutions for extracting correctly. in CV is there any method to fix the issue.

Hi @Melbin_Antu

Instead of using the Cv you can use the Regx or data manipulation.

Regards,
Gowtham K

i need to extract a scanned pdf. That’s why the only way to extract pdf is CV.

Hi @Melbin_Antu

Use Read PDF With OCR activity to read Scanned documents

Regards,

@Melbin_Antu You can use the anchor activity or

try this → Read the Pdf for OCR then do it data manipulation

CV is tailored to actual screen UI data, so if it works, it’s just a happy side-effect :slight_smile:
Can you show us some (anonymised if necessary) screenshots so we can better understand what’s wrong with the CV extraction, nonetheless? Environment details would also help (type of project: Windows/Cross-platform, Studio & UIA package versions, target pdf or screenshot if possible), workflow details (how you’re using the CV Extraction: do you need scrolling? are you using CV Extract Table?, etc.).

Maybe you can also try using Document Understanding or GenAI activities as well…?

using CV only its extracting i tried with omnipage,readpdf,tesseract like ocr but finally some file are extracting only CV.
some file are extracting properly but some file came like clarity, orentation, if clarity fine also sometimes table data not extracting properly like this issue am facing.
any solution for this