It is possible to Extract PDF text with it's Font information(Size,Boldness and Font type)

aditya.prakash · October 24, 2018, 12:51pm

Hello All,

I stuck in finding patterns in some pdf samples
it is possible to Extract PDF text with it’s Font information(Size,Boldness and Font type)

Regards
Aditya

aditya.prakash · October 25, 2018, 10:29am

Hello Every one,

do any one have solution for this?

Regards
Aditya

Tiberiu_Niculescu · October 26, 2018, 8:12am

aditya.prakash · October 26, 2018, 9:51am

Hey Tiberiu ,

Thank you very much for your response, I gone through this post, it suggests to the font to see the font type, boldness!!

here my Requirement is to classify the document based on it’s Font-type, and boldness, my document contains multiple fonts and normal text, bold and extra bold too.

so now my question is, whether, we can use consume Adobe Reader API to Uipath in order to get font details?

Regards
Aditya

aditya.prakash · November 1, 2018, 6:06am

Hello All,

I found a alternate way:

first change that pdf to word By Balarewa.PDF.Activities or any other activity if exists

then you can create a python function to recognize text font information’s

the code is:
import docx
path = ‘/home/karamveer/Downloads/222.docx’ #your docx file path
doc = docx.Document(path)
for p in doc.paragraphs:
name = p.style.font.name
size = p.style.font.size
print name, size

Happy Automation

Regards
Aditya

system · January 28, 2019, 11:32am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Extract Font Information Help	6	2639	January 31, 2019
How to get text color in pdf? Help	4	3238	November 6, 2018
Font Style, Font Color, Font Style and Font Size of Texts in PDF Studio	18	4614	November 3, 2020
Is it Possible to Extract Words Based on Fonts from PDF, DOCX Or XLSX? Studio	3	1193	July 14, 2021
How to determine the Font Type and Color from PDF or word. We are trying to validate if the document is Brand Complaint Activities pdf , activities , question	0	809	April 22, 2021

It is possible to Extract PDF text with it's Font information(Size,Boldness and Font type)

Related topics