Document Understanding: Unexpected page extraction limitations

Hello there.

I am facing an issue with the data extraction of my documents and I am using the UiPath Extended Language OCR. When I run the process and have a document with 20 pages as an input and ,as a result, I get the extraction of the 2 first pages. I saw in another issue in forum that the page limitation in extraction for Community is 2 pages. But in my case I have the Enterprise.

I have tried also another document with 4 pages and all the pages were extracted. The ocr and extractor were the same as before.

Before using this OCR, I tried the UiPath Document OCR and I didn’t have this issue. The reason I want to use UiPath Extended Language OCR is because of the multilingual documents I have.

I also debugged the process and DocumentText and ExtractionResults include all the text of the document I provided. When I convert the ExtractionResults to a DataSet and write it to a workbook I get only the first 2 pages.

Do you have any idea of how could I solve this issue?

@aikaterini.karakasidi

That might be because of the training you have done as well…if the pages are not recognized as valid then they are ignored from final results

if you can share a sample PDF that might help as well

also UiPath document OCR as well has support to multiple languages…extended is to be sued when the normal one does not work…is that the case?

cheers

1 Like

Hey @Anil_G thank you for your response.

During the training of the extractor I included documents with more than 1 or 2 pages (like 10 pages). The document type is bank statements and I uploaded bank statements from only 2 banks.

Unfortunately I can’t share a sample because the documents are confidential.

1 Like

@aikaterini.karakasidi

as mentioned please check the language support of UiPath ocr if it is supported then no need of extended

cheers

I solved it. Because I was using the Intelligent Keyword Classifier the checkbox PerformDocumentSplitting was checked. I unchecked it and now it works.

Thank you anyways.

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.