Want to extract different forms of data from pdf

Reshmita_Vemulapalli · February 28, 2024, 3:20pm

I want to extract tables from pdf but it is in different format for different pdf’s. Tried ML model using ai center with taxonomy- but these worked based on the position, but my client required was based on the name of the table it should extract the value rather than developing ML model based on the position based(ex- drawings required to extract latest revision no, date, title)

Anil_G · February 28, 2024, 5:12pm

@Reshmita_Vemulapalli

That does not work on position…

It does take the column names etc into consideration

Cheers

Reshmita_Vemulapalli · February 29, 2024, 4:52am

Thank you for ur reply,

But it will not take columns name right. In ai center we will give the position and we will only indicate have to take this, but what i required was if we give (ex-stdrollno- it has to give corresponding no).

Anil_G · February 29, 2024, 4:54am

@Reshmita_Vemulapalli

so you are saying column names will change?

it does use the specified column to identify but if those are changing then you might need to use ner model etc…and direct way is not there

cheers

Reshmita_Vemulapalli · February 29, 2024, 4:58am

column names will be same, but it will be on different positions for each pdf(ex- 100 PDF models), here we cannot train every document in machine learning model right, it will be very difficult.

Can u pls guide about user ner model as u stated ?

Anil_G · February 29, 2024, 5:20am

@Reshmita_Vemulapalli

you neednnot train every…you need to train similar documents then it should be able to pick up the columns as needed…those changes can be taken by ai center

some insight on ner

cheers

Topic		Replies	Views
How to extract table same pdf more different format using Document understanding Studio studio , question , document_understanding , activities_panel , pdf-extraction , pdf-tag	1	147	May 26, 2024
How can I extract tables from PDF AI Center question , ai_center	2	696	January 25, 2023
Extracting details using AI Fabric for different PDF AI Center	0	893	April 17, 2021
How to Extract data from different formats of pdf Studio	0	714	April 21, 2020
PDF Data Extraction from different Formats Document Understanding activities , studio , document_understanding	8	1196	May 17, 2022

Most Active Users - Yesterday
Anil_G
mkankatala
sharazkm32
ashokkarale
manasrlenka25
VanjaV
Llessur
maria14
Nishant_mantri
Anelisa_Bolosha1
More details...

Want to extract different forms of data from pdf

Related topics