Machine Learning Extractor- Internal taxonomy fields

Prathyu · September 30, 2020, 6:38am

Hi,

am using machine learning extractor to extract the fields from pdf. i have SGST, CGST and IGST in my pdf. am not able to configure the all three tax fields with internal taxonomy tax fields. and also not able to extract the address and total amount fields fully (only 2 lines in address and first two numbers in total amount am getting).

can anyone please help me on this, howw can i extract all those things.

Thanks in advance

JosephNehl · September 30, 2020, 3:39pm

If you are using an on prem orchestrator then address and total amount can be trained using data manager and the extractor trainer scope. If not, unfortunately there is no way at this time to train extractors in the community or cloud editions, but from my discussions in the past, UiPath is planning on implementing this hopefully in the next couple months for the cloud accounts.

Also, unfortunately the machine learning extractor cannot extract values that it doesn’t not have built in. If you want a machine extractor to pull these, you will need up upload your own custom extractor.

The easier option for both of these scenarios is to use multiple different extractors. You could try using one of the other extractors in tandem and extract only those values with it. I do understand that this is limited as the values must follow a pattern, but you could try using a regex extractor if the locations of these variables change, or a form extractor if they do not change locations.

Prathyu · October 1, 2020, 12:29pm

Hi @JosephNehl Thanks for the information

Prathyu · October 2, 2020, 11:36am

@JosephNehl, can you please tell me how can can i upload my own custom extractor. it would be very helpful for me.

Thanks
Prathyu

JosephNehl · October 2, 2020, 3:04pm

Creating your own custom extractor requires python programming knowledge because you are creating your own Machine Learning tool. If you wish to know more about this, documentation on the structure and requirements for the python files can be found here:

Topic		Replies	Views
Training the models Document Understanding activities , question	3	2310	June 29, 2019
Data Extraction (Data extraction scope --> Machine learning extorter --> configure Extractors) Studio studio , question , tools	8	456	April 21, 2023
How to train ML (Machine Learning) Extractor to take the correct invoice field? AI Center activities	1	1038	January 5, 2022
Exporting the output of Machine learning extractor Document Understanding	4	1600	June 21, 2022
Document Understanding Issue Studio studio , question	1	720	November 9, 2020

Machine Learning Extractor- Internal taxonomy fields

Related topics