I am currently working on creating a Document understanding classic project and would like to know how I can split a document. I currently have 1 pdf and it contains 4 document types that i would like to separate.
hi @Anil_G,
- does this mean I would have to annotate in document manager for those documents and then split in the studio process for that specific document type?
- the reason I as is becuase within the 1 pdf it contains 4 different document types, but there is no way of knowing where those document types are located in the 1 pdf
Basically you would have trained with 4 doc types…so you would split the pdf into 1 page each and send each file separately to classify and depending on which type it is it would be clasified
Cheers
@Anil_G but how will i annotate…wouldnt that make the model misleading?
If each page is a different document type then you would annotate each type itself right…that is what classification does
cheers