Extracting Datas from pdfs

Erradish · July 29, 2024, 10:45am

Hi,
I’ve a different type of pdf with different template but they have the same labels for field.
Which is the best way to extract some specifics data using label?
Example: In all the file I have those labels: Number, Name, Surname.
How do I create a workflow that exctract this data wherever thei position?

singh_sumit · July 29, 2024, 11:03am

hie @Erradish For this use Document Understanding if you are working with multiple pdf with same data type .

cheers
Happy Automation

Erradish · July 29, 2024, 12:28pm

Is there any example of this process or something that will help me to practice with the ML?

singh_sumit · July 29, 2024, 12:33pm

Sure i"ll sending you the link how document understanding work
there is a full playlist for Document Understanding will each activity Detail.go and check the full playlist

cheers Happy Automation

singh_sumit · July 29, 2024, 12:35pm

@Erradish and there is one more trick you can use string manipulation to extract the details if multiple pdf have same pattern

Erradish · July 30, 2024, 12:16pm

Thank you,
I’ve been following his guide but I have an errore while retriving capabilities for Machine Learning. Can u help me with this error?

singh_sumit · July 30, 2024, 12:25pm

Hie @Erradish can you elaborate on the error that you are getting while working or share some screenshot of the error…

Erradish · July 30, 2024, 12:35pm

Sure, I’ll attach the screen.
I watched and replicated every video and now I’m getting this error but can’t solve it
Cattura

singh_sumit · July 30, 2024, 12:39pm

@Erradish this website or portal is likely corrupt you have to look for the new endpoint…

Erradish · July 30, 2024, 12:40pm

I took it from the forum.
At this article Receipt and Invoice AI - Now available in Public Preview!

singh_sumit · July 30, 2024, 12:44pm

@Erradish yes that time this end point is working but as for now its not working now

Erradish · July 30, 2024, 12:45pm

Ok, so what should I do now?

singh_sumit · July 30, 2024, 12:49pm

@Erradish look for the other ML end point on google
or i"m attach a new youtube link hope it will help you …

cheers Happy Automation

Anil_G · July 30, 2024, 5:45pm

@Erradish

These are all public endpoints try them

api key will be oresnet in admin-> trnant → under document understanding

Cheers

system · August 2, 2024, 5:46pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What pdf extraction approach would be for getting different product details from a PDF having multivalued fields? Studio uiautomation	8	366	May 30, 2023
How can you extract data from many pdfs into a excel Activities excel , pdf , question , document_understanding	2	1394	March 7, 2022
Not extracting consistent data PDF Studio studio , question , activities_panel	1	125	April 29, 2024
Extract data from PDFs with varying structures Studio pdf , studio , data_scraping , question	4	566	October 18, 2023
Having Issues Extracting Data from semi structured pdf Document Understanding	2	1096	June 21, 2020

Extracting Datas from pdfs

Related topics