Best way to share Taxonomy files?

MichaelFray · July 29, 2020, 10:48am

Hi all,

I have several processes where I use IntelligentOCR, Classification and Extraction of data. I use the same 'Keywordlearning.json" and “taxonomy.json” for all projects.

What is the best way for all projects to share / use the same files?

Is there a way to store taxonomies in Orchestrator? It would be great with an activity like “Get Asset” - could be “Get Taxonomy”

Thoughts / input?

Cheers,
Michael Fray

supermanPunch · July 29, 2020, 11:07am

@MichaelFray One way maybe to use Serialization and Deserialization. You can Serialize the Taxonomy to a String Type and Store it in an Asset or maybe in a Queue. You can then retrieve it using a Get Transaction Item or Get Asset and then Deserialize the text to a Taxonomy Type. I have not tried Storing and retrieval but I just tried to Convert it by Serializing and it works. It might work for your case as well.

MichaelFray · July 29, 2020, 11:37am

That could work Thanks for the suggestion!

MichaelFray · July 29, 2020, 11:44am

Any chance you can share your sample? Thanks

supermanPunch · July 29, 2020, 11:52am

@MichaelFray Check this post :

The Conversion I think is From a Dictionary to a String. You can use the Taxonomy instead of the Dictionary. You can change the Type in Deserialize Json Activity to the Taxonomy Type.

Although there is no Set Asset or Add Queue Item used, you can add the Activity and then get the Item From the Orchestrator and then use Deserialize Json Activity on that Item.

Let me know if you face any issues.

Ioana_Gligan · July 29, 2020, 12:04pm

hello @supermanPunch and @MichaelFray,

You are right, this is the way to do it

To serialize, just use objTaxonomy.Serialize (or just grab the content of the taxonomy file), and you can use the class method DocumentTaxonomy.Deserialize(strTaxo) to obtain an objTaxonomy.

Same applies to DOM and extractionResults.

For Keyword Learning content: you can grab the content of the learning file, put that content as string wherever it suits you (database, queue, asset , bucket storage etc), then just retrieve the contents, and use that instead of the LearningFilePath, within the LearningData variable.
Even for training the keyword / intelligent keyword / classifiers, the LearningData is In/Out - so it gets the LearningData at the state before training, and when the training finishes, the same variable contains the new, modified LearningData.

Hope this helps!

Ioana

MichaelFray · July 29, 2020, 12:13pm

Thanks again!

MichaelFray · July 29, 2020, 12:13pm

Many thanks @Ioana_Gligan - very useful!

system · August 1, 2020, 12:13pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How can I effectively store taxonomy within Orchestrator and subsequently retrieve it for various document understanding projects? Document Understanding orchestrator , question , taxonomy	3	646	October 12, 2023
Work in same taxonomy Activities uiautomation , activities , question	1	324	October 9, 2023
LearningFilePath in Intelligent Keyword Classifier Trainer Activities activities , question , document_understanding	2	255	December 16, 2023
UseCase Taxonomy Manager Help activities , question	6	1715	April 28, 2020
Document Classification Help studio , question , c_sharp	3	3243	July 22, 2020

Most Active Users - Yesterday
ashokkarale
Yoichi
Julian_Muhlbauer
miwa_yamamoto
Anil_G
A_Learner
BharathKamalapur
madhabhazra0
Lalasa_Mulakaluri
bayu.herlambang
More details...

Best way to share Taxonomy files?

Related topics