Hi all,
I have a PDF of 165 pages and I want to scrape page one table data using document understanding. But when I went to make a template I noticed that I have to give every 165 pages matching info, which is not possible.
Is there any way to scrape data just from one page without giving 165-page matching info?
I have doubt that if you pass the 165 Page’s in du I mean in digitization it will take more time right while digitizing… I have worked on it soo i have 400 pages so it will taken lots time based on data we spillted soo that’s why I’m asking to you
If you need to split pages let me know i will guide that you
Then I would recommend to split and extract the pdf file first and then try to digitise it
You can use the Extract PDF Page Range activity to split the PDF file into individual pages. Then, you can use the Extract Document Data activity to extract the data from the first page.
I have created a taxonomy, three categories for three different page ranges(ex. 1,5-10,50-55pages)
I wanted to convert the 3 tables of pdf into 3 different datatables.
Can I make 3 “manage templates” from “Form Extractor” activity for 3 different page ranges?
Can you say that for pages 1,5-10,50-55 all are come always or 1 or 5-10 or 55-10 any one prasent in the pdf
Coming to manage templates, in hole pdf if it has same thing to extract… You can create the 3 template s in single form extractor that will ok
To create
But if extracting is different for your all range of pages you can take multiple of form extractors nothing will happen soo it will work
Note : Creating a multiple templates is in the scenario of for example you have one pdf and you need to extract data but the data slightly different like adding extra space at the time you can add multiple templates to that extractor… But extracting fields is different means you need to take the multiple extractor @Abhoy_Kumar_Mondal