Read PDF with OCR not extracting the first few lines

,

Hi,

I am trying to read pdf thorugh OCR and write it to a text file. The pdf has both searchable text and text in images. When I use the ‘Read PDF with OCR’ activity, it is ignoring the first line which is the header of the PDF document. In the output, I can see text from second line onwards. Please note that both the first line and second line are searchable text in the pdf.

I tried ‘Read PDF text’ activity and see similar results. I will not be able to share the pdf.

Has anyone faced a similar issue earlier?

Thanks

I also have a similar problem. Read PDF With OCR in C# is returning a single line when using the UiPath Document OCR activity. Tesseract OCR returns multiple lines, but it’s accuracy doesn’t look so good.

Any advice would be appreciated.