PDF Scrapping using Get Text

Hello everyone, this is my first interaction with the forum.
I have multiple PDF files in one folder, and I need to iterate through each one of them to extract specific data.
I’m using a “for each file in folder” loop, and in the body, I’m using a “use app/browser” action.
When I open the PDF file in Adobe, it says that the document isn’t accessible to a screen reader

When I try to use the “Get Text” action to extract the data, I can’t select a specific value. In the “Print2” section, I can’t select only “456789012” even though it’s a CV element.

Print2

I would like for some explanation and help
Thank you

You should be using the PDF activities.

I know I can use the ‘Read PDF Text’ to save the text to a string and then manipulate it. But why can’t I select only the number using ‘Get Text’?

Because a PDF isn’t like a web page or app with selectors and individual UI elements.

Ok thanks for the explanation

Hi @Rafael_Carvalho

Welcome to the UiPath community.

You can use regex for extracting the required text.

Cheers

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.