Read PDF Text cannot get the values correctly



Hi all,

As we usually receive many invoices/debit notes in PDF format and want to read it with the RPA.
However, i find the following problem that makes my RPA failed, would anyone help.

I use “Read PDF Text” to read the PDFs.

All PDFs are in the following format:

However, when i copy and paste these PDFs in Notepad, their actual formats are:

Some are…

Vendor Code: 00001
Invoice No.: 11111
Page 1 of 1

But some are…

Invoice No.: 11111
Page 1
of 1
Vendor Code:

Some are

Vendor Code:
Invoice No.:
1 of 1

many many formats…

How can i get the Vendor code, Invoice number correctly?

Thanks so much


Can you share a sample pdf ?