Extracting Datas from pdfs

Hi,
I’ve a different type of pdf with different template but they have the same labels for field.
Which is the best way to extract some specifics data using label?
Example: In all the file I have those labels: Number, Name, Surname.
How do I create a workflow that exctract this data wherever thei position?

hie @Erradish For this use Document Understanding if you are working with multiple pdf with same data type .

cheers
Happy Automation :grinning:

Is there any example of this process or something that will help me to practice with the ML?

Sure i"ll sending you the link how document understanding work
there is a full playlist for Document Understanding will each activity Detail.go and check the full playlist

cheers Happy Automation

@Erradish and there is one more trick you can use string manipulation to extract the details if multiple pdf have same pattern

Thank you,
I’ve been following his guide but I have an errore while retriving capabilities for Machine Learning. Can u help me with this error?

Hie @Erradish can you elaborate on the error that you are getting while working or share some screenshot of the error…

Sure, I’ll attach the screen.
I watched and replicated every video and now I’m getting this error but can’t solve it
Cattura

@Erradish this website or portal is likely corrupt you have to look for the new endpoint…

I took it from the forum.
At this article Receipt and Invoice AI - Now available in Public Preview!

@Erradish yes that time this end point is working but as for now its not working now

Ok, so what should I do now?

@Erradish look for the other ML end point on google
or i"m attach a new youtube link hope it will help you …

cheers Happy Automation

@Erradish

These are all public endpoints try them

api key will be oresnet in admin-> trnant → under document understanding

Cheers

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.