What is the algorithm of the keyword based classifier and for the trainer?

yrobert · May 12, 2021, 9:41am

hey there,

I was looking through the documentation of keyword based classifier and its trainer:

Keyword Based Classifier
https://docs.uipath.com/activities/docs/keyword-based-classifier-trainer
and so on…
But I did not find what algorithms are used to classify and to train. It is just an overview explanation.

I would like to understand the algorithms in order to be able to foresee the risks of error and stability.
Multiple questions rise like:

how is each keyword weighted?
how do these weights change after 1 training?
how many documents do need to cycle through to have sufficient training?
if we make a general category of document (like invoice) the titles / selected keywords can change a lot (depending on the invoice provider), how does the higher volume (hence more frequent training) of 1 format affect the categorization confidence of the other formats?

it would be great to have more details on how this keyword classifier and trainer work. more questions will probably arise. This topic is to start to deepen the understanding.
Thanks!

Topic		Replies	Views
Document understanding classifiers Activities activities , question , document_understanding	3	149	July 3, 2024
Document Understanding classifer Question Activities database , question	8	525	August 18, 2023
Keyword Based Classifier for OCR & ML Document Understanding question , document_understanding	3	1895	September 13, 2020
Intelligent Keyword classifier- Document Understanding Studio orchestrator , robot , studio , question , tools	1	1191	December 31, 2021
How to use Keyword based classifiers - Invoice and receipt AI processing Help	3	3607	July 18, 2020

What is the algorithm of the keyword based classifier and for the trainer?

Related topics