When we use Read PDF Text or Read PDF Text With OCR it always returns the text into string format however sometimes I feel it should also provide the structured format like if I look at the extracted text and pdf file, it should look same. I am not sure if you have seen or not but this feature is available in AA. I’m not comparing this tool right now however sometime I feel if UiPath includes that feature, it will become more awesome.
Though read pdf activity works in the background without requiring the pdf to be opened which is great but it didn’t preserve the structure.
I will check this out too since I have a similar problem of the structure being lost when I used read pdf activity. I have used get full text or native text activities as an alternative as these preserve the structure. But these activities require us to open the pdf.
This is about pdf docs that are not scanned but native pdf documents so i have used read pdf text activity and not the one with ocr. Ocr is not needed ryt.