How to get text from pdf

Ajinkya_Deshmukh · December 6, 2018, 12:49pm

I want to get words from the pdf which is opened in the Chrome browser.

From the above image from the pdf, I want to fetch specific text which is marked in red. Pdf template is the same for all the documents.
I have tried the get text activity but did not work. On using get text activity, the complete page gets selected.

Please help.

Divyashreem · December 6, 2018, 1:46pm

if you want only those two values iii.New exhibit space selection is constant go for relative scraping.

Head · December 6, 2018, 2:17pm

Hi,

Can you provide example of pdf?

Ajinkya_Deshmukh · December 6, 2018, 2:39pm

Hi,
I am opening the pdf document in the chrome browser. When I am trying to select the text from this opened pdf, the complete text gets selected.
On using scraping, it shows error that it is not supported in only internet explorer.

please help.

Ajinkya_Deshmukh · December 6, 2018, 2:41pm

I am trying to open the pdf document from the chrome browser.
This happens for all the pdf documents.

Divyashreem · December 6, 2018, 2:49pm

is there a way to download the PDF.?

Ajinkya_Deshmukh · December 6, 2018, 3:41pm

Yes. We can download the pdf file.

Madhavi · December 6, 2018, 5:20pm

@Ajinkya_Deshmukh Download the PDF, if its in native format, use “Read PDF Text” activity, else use "Read PDF with OCR activty to read the PDF content. The output will be of data type String.
Do string manipulation or use Matches activity with regex pattern to match the value you want to retrieve. This will give you the required values.

Divyashreem · December 6, 2018, 6:17pm

You can follow watever @Madhavi has mentioned, if it is not readable PDF and your OCR results are bad then go for relative scrap.

Topic		Replies	Views
How to extract the text from pdf which is opened in the chrome browser Studio uiautomation	5	1294	November 9, 2023
Read pdf from browser Help	6	2239	October 13, 2018
Read PDF online in CHrome - the best activity for this Help activities	5	2952	August 14, 2020
Read Pdf Text from a pdf on a webpage Help pdf , activities , web , question	16	2334	November 7, 2019
Extract Text from PDF / specific elements from pdf / Selecting each paragraph / Accessibility Settings Help pdf , activities	21	17578	October 11, 2019

Most Active Users - Yesterday
Anil_G
mkankatala
V_Roboto_V
avinashy
Vhierdy_Hafidz
Simon1
SenorChang
Llessur
postwick
sharu_priya
More details...

How to get text from pdf

Related topics