Training the models

Anupam_Mittal · June 28, 2019, 4:19am

First of all, congratulations for launching a great feature. I tried what you said and it worked in a jiffy. I am using taxonomy manager to define the fields that I want to use. However I am ignoring the FieldID generated for these fields in taxonomy manager. Instead, as you advised, I am now using taxonomy strings from the list of available field names. And it works (like magic).
My next step is to train the extractor further. I have added a Train Extractor Scope activity to the workflow. But not able to find any trainable activity that goes into it. I would like to improve on the parser by using train extractor to capture fields that are not available in the above list (Like GST Number)

alexcabuz · June 28, 2019, 9:33pm

Hi @Anupam_Mittal ,

Training is not possible for now, the models are available as is. Training the models on premises on arbitrary documents is a capability we are planning for Q1 2020. In the meantime please let us know what fields you need added, and we will do our best to add them.

Regarding the GST number this is a number with a very specific format, and that might be extracted much more effectively using a regex.

Cheers,
Alex.

Anupam_Mittal · June 29, 2019, 5:24am

thanks for your response. As of now, these are the additional fields that I can use:
In Items:
serial number
tax amount
tax/sac code

In invoice:
Amount in words (this can help validate total invoice amount)
Payment Terms
Vendor Name
Billing Name

Anupam_Mittal · June 29, 2019, 5:27am

Regarding GST, I agree that identifying GST is easy with regular expression. However, GST/VAT usually has two instances - one for the billing person and one for the vendor. If Machine Learning can distinguish between the two, that could be useful.

Topic		Replies	Views
Do I need to define the fields that I want to extract in taxonomy manager or are these available automatically from end point url? Document Understanding studio , question	3	2929	July 10, 2019
Document Understanding : How to Extract Invoice Fields not Supported by Builtin ML Model AI Center question , document_understanding , ai_center	9	2528	March 1, 2021
ML extractor trainer Document Understanding activities , question , document_understanding	2	609	June 22, 2023
Model Training document understanding Document Understanding ai , document_understanding , help	4	646	June 18, 2023
Machine Learning Extractor- Internal taxonomy fields Academy Feedback activities , question , ml	4	1386	October 2, 2020

Most Active Users - Yesterday
Anil_G
ashokkarale
kkpatel
adilhassanpost
yedukondaluaregala
V_Roboto_V
More details...

Training the models

Related topics