I have an MS Word document with some text boxes inside.
When I read the document using UiPath.Word.Activities.WordReadText the result lacks the content of the text boxes.
I took it further and created a PDF file from that Word doc.
The result: the text boxes appear in the PDF file and when I read the PDF file using UiPath.PDF.Activities.ReadPDFText, the result contains the content of text boxes (unlike the reading of the original Word documet).
Oh, sorry, as a new user I’m not allowed to upload files. However I’ve uploaded the files to a shared Dropbox folder:
‘Sample.docx’ is the original document with text boxes. Two sample text boxes are marked in red color.
‘Sample.pdf’ is the pdf version of ‘Sample.docx’.
‘Extract_From_Sample_docx.txt’ is the txt file extracted from ‘Sample.docx’ using UiPath.Word.Activities.WordReadText.
‘Extract_From_Sample_pdf.txt’ is the txt file extracted from ‘Sample.pdf’ using UiPath.PDF.Activities.ReadPDFText.