General Questions regarding Classify Document Scope

potchak · August 6, 2020, 7:04pm

Background : We are in process of building a POC for a healthcare company that wants a robot to basically help them determine additional diagnoses based on their current diagnosis. Most of this is done post mortem for insurance purposes. So we decided to use the classify documents as our way to determine these diagnoses.

Since I am very new to Document Understanding, I was hoping to have some questions answered regarding Classify Document Scope.
–What is the best way to user Classifier Manage Learning? Single Line? Multiple lines?
–What is the difference between using one line, or multiple lines?
– What is the difference in using multiple words on one line, vs separate lines for each?

TIA!

@Ioana_Gligan

tudor.serban · August 6, 2020, 7:25pm

I assume you are using the Keyword Based Classifier? In this case, for each document type you can define one or more keyword sets (I assume this is what you mean by lines). A document is classified as a certain type if ANY of the keyword sets for that type matches the document and then, of all matching types, the one with highest confidence is selected. Within a keyword set for a document type ALL keywords must be found within a document in order for the document type to be a valid candidate.

Example: My document is “The quick brown fox jumps over the lazy dog” and my possible document types are “document about fox” and “document about bear”.

For document type “document about fox” I define the following keyword sets:

“fox”
“brown”, “fox”
“red”, “fox”

For document type “document about bear” I define the following keyword sets:

“agile”, “bear”
“bear”

When I classify the document, keyword sets 1 and 2 for “document about fox” will match the document, but 3 won’t. Neither of the document sets for “document about bear” match the document so the type reported is “document about fox”.

PS: The above is specific to Keyword Based Classifier. Future classifiers (of which you can now test a preview of Intelligent Keyword Classifier), will have completely different setups.

potchak · August 6, 2020, 8:09pm

Yes, I was using Keyword Classifier. My apologies.

Topic		Replies	Views
Keywords and Classify Document Scope Document Understanding	7	2311	August 24, 2020
Classification Results - Multiple documents in a file - not being classified Document Understanding	3	3049	August 11, 2020
Keyword Based Classifier Documentation docs , question	4	1139	April 19, 2022
Classification Station Reference Set Document Understanding document_understanding	2	842	April 22, 2021
"Keyword based classifier" is showing error "Please select evidence for the field" despite having keywords updated Document Understanding activities , question , document_understanding	3	1774	September 29, 2020

Most Active Users - Yesterday
Anil_G
ashokkarale
kkpatel
adilhassanpost
yedukondaluaregala
V_Roboto_V
More details...

General Questions regarding Classify Document Scope

Related topics