How to extract the text from pdf which is opened in the chrome browser

Hi,

I am searching names in a pdf if that name is available in the pdf then I need to extract that name. This pdf is opened in chrome browser.

Regards,

Hi @raju_alakuntla

You Can Extratct it by using the cv get text activity with in th cv scope

Cheers!

Hi @raju_alakuntla ,
Thanks for reaching out to UiPath Community.

You can follow these easy steps.

-Open the PDF in Chrome:

  • Use the “Open Browser” activity to open the Chrome browser.
  • Navigate to the URL of the PDF document you want to search.
  • Search for Names:
    • Use the “Type Into” activity to input the name you want to search for in the PDF.
  • Capture the PDF Content:
    • Use the “Take Screenshot” activity to capture the content of the PDF displayed in the Chrome browser.
      Then comes the OCR part.
    • Use OCR (Optical Character Recognition) activities, such as “Read PDF with OCR” or “Read Text from Image,” to extract text from the captured screenshot. This text contains the content of the PDF.

Then when you need the name or any extracted value from the PDF you can try regular expressions or string manipulation to find the name in the text.

Regards,
@pratik.maskar

@raju_alakuntla

  1. Use the “Open Browser” activity to navigate to the PDF URL in Chrome.
  2. Attach to the Chrome browser using the “Attach Browser” activity to interact with its elements.
  3. Use the “Type Into” activity to enter the name you want to search for in any available search field within the PDF viewer.
  4. Simulate a click or use the “Send Hotkey” activity to trigger the search action (e.g., pressing “Enter”).
  5. Utilize the “Get Text” or “Screen Scraping” activities to extract the text from the PDF content.
  6. Use string manipulation or regular expressions to extract the name from the extracted text.
  7. Check if the name was found in the PDF content using conditional activities like “If.”
  8. If the name is found, store it in a variable for further use; if not found, handle the absence accordingly.
  9. If you need to search for multiple names, consider looping through a list of names and repeating the process.

You can use Get Cv text or get text once it completed
Check if that Name is avilable or not by using regex or Value.contains

In Regex is match activity will give you true or false based on that if condition to get the value.

Next use Contains in if condition that also work.

Hi @raju_alakuntla
Try these below steps

→ Use the use applicatoion/Browser activity to open the Chrome browser.
→ Use the Navigate to activity to navigate to the URL of the PDF you want to search.
→ Use the Take Screenshot activity to capture the content of the PDF displayed in the Chrome browser.
→ extract text from the captured screenshot. This text contains the content of the PDF.
→ Store the extracted text from the PDF in a variable.
→ Use the String Manipulation activities to search for the specific names you want to find in the extracted text.
→ If the name is found, store it in a separate variable.

Thank you