Getting text from pdf

Hello all, I’m facing an issue regarding extracting text from pdf. It returns a wrong value, so what I’m trying to get is the value of 2500 Long-Term Debt which is
603,507. However, it returns 2,887,230 which is rows bellow it.

Here are the steps am using: first I added the Anchor base activity and within it I have added find element which seems to be fine and a get full text which have the issue

Sequence.xaml (8.7 KB)
Balance-Sheet-Example (1).pdf (481.1 KB)

Thanks in advance

@mounir.mohsen

Check this Workflow, i hope it will help you-
Sequence.xaml (7.6 KB)

@mounir.mohsen

Use the expression -

System.text.RegularExpressions.Regex.Match(PDF_test,"(?<=Long-Term Debt +)[\d\.\,.]+").ToString
2 Likes

@mounir.mohsen welcome back to forum,

it’s always good to pdf activities and your pdf looks like native pdf, read the pdf with read pdf text and apply regex to extract the required information.

Thanks,
Guna

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.