How to rad invoice number from scanned PDF

Hi,
can you please guide me to extract invoice number,date,amount from scanned PDF.
Im new to UiPath, please guide me.

Hi,

For scanned PDF you can use Read PDF text with OCR and the output you get you can use Regular expressions.

If the output of Read PDF text with OCR is not clear then you can use newly introduced Computer vision activities which extract accurate values from scanned PDF.

Refer the below post to see how to install Computer Vision activities.

There is a training video for computer vision in RPA Academy

5 Likes

Hi… i think the best way to deal with it is to use AI Computer Vision as @anil5 mentioned. It is much easier to use and more accurate… If you are registered in UIPath academy, there is a separate section to read PDF in level 1 foundation training… going through that will give you a basic idea as well…

1 Like

thanks @anil5 for you reply.
i will check and update you.

How can we use CV activities without the file being opened. Is there anyway if we can read using CV just like read pdf without opening file.

1 Like

Hi @mohammedamaan,

To use CV activities , file or PDF should be opened , that’s y we are capturing the scope first by using CV screen scope activity and then using other activities inside CV scope to extract values.

1 Like

Hi @anil5,

In that case, if i need to get data from the lower part of the pdf will I have to scroll down and then again capture the scope?

1 Like

Hi @mohammedamaan,

That’s true

1 Like

Thanks.
Also wanted to know can we scan handwritten pdf as well using CV. I tried but I am not able to get the desired output. Don’t know if I am missing something to incorporate the same.

1 Like

Can you try different ocr engines and try

1 Like

That is very time consuming to open the file and capturing the scope manually.