Extracting data from PDF Invoices from ma

Sinan_Bolel_DoIT · January 3, 2020, 1:25pm

You cannot customize the taxonomy – at least not with the community edition endpoint. The models/taxonomies used for the community endpoints are pre-determined by the UiPath team.

You should be able to get those fields you said though. If you open the Taxonomy Manager, you’ll see that the invoice taxonomy has some, if not all, of those fields:

If you want to customize the model used by the AI server, you can deploy a local machine learning server, which will be more complex.

You can also take a totally different approach and digitize the entire document with OCR, then use regular expressions to parse out parts of the invoice. That will not involve the UiPath machine learning server.

Another approach is to use an OCR engine in conjunction with custom extractors, but that’s something I can’t yet help with – I’m still trying to understand how this works myself haha.

system · January 6, 2020, 1:25pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
PDF Data Extraction (Invoice) Activities activities , question , document_processing	6	1630	February 19, 2021
How to extract data from pdf Help selector , uiautomation , activities , question	10	964	February 5, 2020
Need suppoty to extratc data from PDF to excel sheet Robot	21	570	October 12, 2023
Anchor Base activity issue Help	9	1124	July 23, 2019
Error on Extracting pdf data Help excel , pdf , activities , question	22	1451	December 13, 2019

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

Extracting data from PDF Invoices from ma

Related Topics