Data Extraction (Data extraction scope --> Machine learning extorter --> configure Extractors)

krishna_priya · April 21, 2023, 11:17am

I try to Extract Details From the PDF I use the Document understanding for the extraction in the taxonomy (Data extraction scope → Machine learning extorter → configure Extractors)

in the Configure Extractor there are some fields only there
I want to extract the(Supplier Code, Item code and purchase ID) But those fields are Not available in the Data extraction scope (Configure Extraction )

how to extract the details of Supplier Code / Item Code / Purchase ID

From Extractor not applicable for my process.

Anil_G · April 21, 2023, 11:19am

@krishna_priya

Try using po number or item po number…they might be linked to the required ids

Cheers

arjunshenoy · April 21, 2023, 11:24am

Hi @krishna_priya

You might have to use data labeling session and train a custom ML model in order to extract the specified custom fields such as Supplier Code from your invoices, as the Out-of-the-Box Invoice model may not contain the fields what you want to extract.

Best Regards.

krishna_priya · April 21, 2023, 11:25am

yeah I tried bro
But Data can’t extracted

this is any is there for extract the details of (Supplier Code / Item Code / Purchase ID)

krishna_priya · April 21, 2023, 11:27am

want to extract the details of the Supplier Code / Item Code / Purchase ID in the PDF

Anil_G · April 21, 2023, 11:32am

@krishna_priya

Then you need to retrain the model with the columns you need deploy a new skill

cheers

arjunshenoy · April 21, 2023, 11:34am

@krishna_priya

Yes. Already available Out-of-the-Box model may not contain such fields in their schema already. You need to use a feature called ‘Transfer Learning’ in order to train your model to extract such custom fields.

In this you will take an existing model & add the fields that you want to extract. Then train the model with sufficient data, train & transfer the model, which can be used to extract the fields.

In order to do this, you need to create a dataset in the action center, label the data field that you want to extract, export the labelled data and train it in the pipeline. Deploy the ML skill on the trained pipenline package and use the same in the project.

Data labeling would look something like this:

Hope this helps,
Best Regards.

krishna_priya · April 21, 2023, 11:34am

I’m using the Endpoint and Document Understating ApiKey in the machine learning extraction

system · April 24, 2023, 11:35am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Document Understanding : How to Extract Invoice Fields not Supported by Builtin ML Model AI Center question , document_understanding , ai_center	9	2601	March 1, 2021
AI Center \| Data Extraction Scope Error \| Custom ML Model AI Center question , ai_center	29	1715	April 20, 2022
Machine learning extractor Help activities , question , ai	1	1383	March 23, 2020
ML Extractor Unable to Extract the details using the Extractor Studio studio , question , activities_panel	5	66	December 10, 2024
PDF - Invoice Data extraction only of product name and Quantity Studio	14	3070	April 30, 2020

Data Extraction (Data extraction scope --> Machine learning extorter --> configure Extractors)

Related topics