I am back again.
This time I think I found a solid workaround for the problem I have been facing with Data Scraping.
I feel pulling all the HTML and then parsing thru the data to get the required information is the most consistent way of doing so with this web application (if anyone has an alternative please let me know).
Well, let me walk you through what I am trying to do.
I am pulling HTML that is located inside a DIV tag, then parsing thru those lower tags to get my required information (Name, Address etc…).
However, the data will span multiple pages in some instances. I have attached a snapshot to this which shows the HTML code for this particular button.
Whenever I reach the last page the ‘button’ tag will have a ‘disabled’ attribute equal to “disabled” when it has reached last page. This can be indicator that I no longer need to move forward and can finish automation.
However, instead of parsing thru each tag which will suck up memory I am wondering if there is a quicker way to parse thru HTML to get both the required info I need (NAme, Address etc…) AND the button tag status (disabled or not). Has anyone done something similar to this and can provide some helpful hints?
Does this seem like a reasonable approach?
Thanks so much.