I have a Word document,Where i have Pages around 75,In this need to extract specific data .For an example: document which has an index of topic like “Mangement discussion” which starts in 31st page and it ends in 44th page,by 45th pages it has another topic as Quality and materials. pages may vary from one documnet to another documnet.but heading will be same.can you please guide me how to scrape the data from 31-44.
But i got topics and the page number,but i didnt get content under the topic.
i have attached my flowchart .plz check .
Thanks Flowchart.xaml (16.9 KB)
For both start index and end index, you are using the same topic,
intStartIndex = StrWord.IndexOf(“Management’s Discussion and Analysis of Financial Condition and Results of Operations”)
intEndIndex -= StrWord.IndexOf(“Management’s Discussion and Analysis of Financial Condition and Results of Operations”) => This should be “Quality and materials”
and step 4:
strManagementContent = strWordContent.Substring(intStartIndex, intEndIndex - intStartIndex)
and intStartIndex and intEndIndex variables should be of type integer.
Hi,
Thanks for your help.
Sorry for the late reply.yeah i did the changes as you said…
Please find the attachment both xaml file and docx. Flowchart.xaml (12.0 KB) MSFT_FY20Q2_10Q.docx (893.3 KB)
Hi,
Thank you so much for your help, your code was really helpful.
Am able to scrap the page number for the start index.with reference of your code i scraped the end index.
with the start index and end index i have copied the content of the document .