I want to extract the data from prescription. The prescription is in Hindi language.
The data’s I need to extract (Doctor Name, Mobile Number, Email, Address)
If you are trying to extract letterhead details which are printed and not written by doctor, you can use OCR extraction or Document Understanding for it.
Thanks,
Ashok
Hindi is under supported language for ocr …did you try to extract using du tempalte?
Cheers
@Anil_G and @ashokkarale Thanks for replying.
My requirement is needed to extract the data from prescription which is in JPG and JPEG format. Some data is in letterhead and sometime doctor name is in handwritten.
I used Document understanding in AI Centre. While labeling the document itself it cannot recognize the correct text from the image. For your reference I have attached the document below.
I used this ML package → out of box packages → UiPath Document understanding → UiPath Document understanding.
Pipeline Type - Train run
In UiPath studio used Omnipage OCR for digitize document.
Extractor - ML Extractor.
Please let me know what I need to correct
indocuemnt understanding project settings did you happen to choose the ocr like below
cheers
Yes, I have selected the UiPath Extended language and OCR URL is auto generated but I cannot find the OCR API Key. Can you tell me where to find that.
if you see ocr key would be mentioned as optional and auto populated
just select apply ocr as yes
cheers
If yous elect everythign properly it should be working
I just tried and it is extracting
steps
- Crate AI center project
- Upload docs in dataset
- Create a labeling session
- In labelign session first select the ocr engine as extended and select apply as yes
- then import docs and label
cheers
Thanks @Anil_G It’s working now for the newly uploaded file not for the existing file.
This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.