Is it Possible to Extract Words Based on Fonts from PDF, DOCX Or XLSX?

Dears,

Is it Possible to Extract Words Based on Fonts (Bold, Italic, Size …etc) from PDF or DOC ?

image

Thanks in Advance

You could use a python script, like detailed here Extracting bold text from Resumes( .Docx,.Doc,PDF) using Python - Stack Overflow
With the activity https://docs.uipath.com/activities/docs/run-script

1 Like

Thanks @MarinAlexandru , Any Equivalent Using UiPath?

I’m not sure it can be done without running some sort of script or code through UiPath studio, don’t think there is an activity specifically for the task you need :slight_smile:
But it shouldn’t be hard to implement, just install the python package, and copy and adjust the code from stackoverflow until you are satisfied with the result.

1 Like