How to Extract "Total amount" from multiple PDF

pdf
text

#1

Hi,

I’m newbie here and would like to know if there is a simple way of getting “Total amount” from multiple PDFs?

The PDFs is stored in the same file and are all in the same format. I’ve managed to go through all the PDFs in the file by using Directory.GetFiles(“C:\Users\User\Documents”, “PDF Fakturor\2410006500*.pdf”… But I’m only able to use the “Read PDF Text” activity which means I get the whole text for Output when I actually just want the “Total amount”.

Anyone got a solution for this?

Thanks


#2

Hello,

I would suggest you to use substring after reading the PDF, you can learn about it here

use assign activity after read pdf activity

pdf_text.Substring(pdf_text.IndexOf(“Amount:”)+7,12)

7 means 7 characters after the first character.
12 means how many characters after the last index, in this case 7.

Here is an example, just change the invoice_folder_path

NPO_debug.xaml (12.6 KB)


OCR on a PDF: Targeting using Coordinates not accurate
#3

hello beesheep,

in the xaml that you had posted (thanx for that)

how we can write the final value to an excel table?


#4

Hello, like what? can you elaborate a little more so I can fully help you.
in the mean time you can use a write cell activity.

regards.


#5

Hello,

in the example you had extracted a string iwant just to store it in an XLS file. i had tryed this butit does not work


#6

Essayez ceci:

Modifiez d’abord le fichier d’extension.
Deuxièmement, bien, juste le premier doit fonctionner MDR

Changez-le à xlsx.


#7

OH ! Look, same issue here, it seems an update from Uipath

@badita is there a way we can confirm?

here is the WF Examples (2).zip (494.7 KB)

also tagging @aksh1yadav and @acaciomelo


#8

I am running the WF on a different computer and works fine.