Trying to capture text data from a webpage - Cant seem to make it work

richygeorge · May 17, 2020, 11:07pm

i am trying to capture the indicated data (See screen shot) from the below links. I have tried: screen scraping, get text full/hidden, CV get text. nothing seems to be working.

I am trying to scrape data from multiple pages. The data is not structured (missing fields and different positions). All the methods i have tried is giving me different errors. I am new to uipath. i am sure my lack of knowledge is the reason. any help would be much appreciated.

https://www.mlsli.com/homes-for-sale/389-Wellington-Road-East-Meadow-NY-11554-291890211
https://www.mlsli.com/homes-for-sale/2694-Forest-Avenue-East-Meadow-Ny-11554-295840218
https://mlsli.com/homes-for-sale/15-Meadow-Road-Old-Westbury-NY-11568-276979233
https://mlsli.com/homes-for-sale/2020-1995-Meadow-Lane-Southampton-NY-11968-283256627

gulshiyaa · May 17, 2020, 11:22pm

Hi @richygeorge

Use Anchor base activity inside that first left side use Find elements and indicate school: then in right side use get text and indicate East Meadow then use message box and pass get text output

Regards,
Gulshiyaa

richygeorge · May 17, 2020, 11:38pm

@gulshiyaa Thank you. So i tried that.
But i am getting the next line below instead. please see screen shot.
Also, anyway to get the the text after the “:” - “4 Full/1 Half”

Thanks for your help.

richygeorge · May 18, 2020, 1:11am

@Leon_Petrou

gulshiyaa · May 18, 2020, 7:39am

Change the selectors and try

richygeorge · May 18, 2020, 11:08am

I am not that familiar with selectors. how/what do i need to change to capture the right items?
i need the highlighted items extracted from the 1st screen shot

richygeorge · May 18, 2020, 11:09am

@AndersJensen

gulshiyaa · May 18, 2020, 2:31pm

Can you send me the link

ppr · May 18, 2020, 2:42pm

@richygeorge
you will combine some approaches but a 1 minute test with datscrapping looked not bad:
grafik

richygeorge · May 18, 2020, 10:10pm

@ppr Peter, can you please tell me how you did that? can you please screen shot the sequence or share the .xaml please? Thank you much appreciated. thanks.

richygeorge · May 18, 2020, 10:11pm

https://www.mlsli.com/homes-for-sale/389-Wellington-Road-East-Meadow-NY-11554-291890211
https://www.mlsli.com/homes-for-sale/2694-Forest-Avenue-East-Meadow-Ny-11554-295840218
https://mlsli.com/homes-for-sale/15-Meadow-Road-Old-Westbury-NY-11568-276979233
https://mlsli.com/homes-for-sale/2020-1995-Meadow-Lane-Southampton-NY-11968-283256627

richygeorge · May 18, 2020, 10:12pm

i think since different pages have different structure and data based on the city i am scraping it keeps giving me error.

gulshiyaa · May 20, 2020, 3:11pm

In Find element indicate before data

for example if you want to take bedrooms indicate the find element befor that particular data which means property type…

but in get text indicate bedrooms

problem sloved

then you will get bedrooms: 4 so you just split like this

out variable of gettext.split(":"c)(1)
so you will get 4

regards ,
gulshiyaa

sorry for the late reply

ppr · May 20, 2020, 5:05pm

@richygeorge
Sorry für delayed answer. I configured the Data scrapper on the sequence of the items from above Screenshot. As summary doesnt have a Second element, so IT can Not be included into the Set of correlated Data.

An Initial Look on the Elements structure found Out a series of divs with some particular Markers in class or Id.

All in all a more deep Analysis of the structure is recommended to find Out the best retrieval strategy. IT seems that by combining different Methods IT should be possible.l

Topic		Replies	Views
Data Scraping from Multiple web application input fields Help	47	4546	October 14, 2019
Help with extracting data Learn	0	753	May 6, 2020
If data scraping is not possible Help question	3	725	February 6, 2020
Extraction of info from web page into excel file Help	8	4601	August 26, 2019
Scrape text from web page Studio uiautomation , activities , question	4	812	February 20, 2020

Most Active Users - Yesterday
mkankatala
ashokkarale
ppr
Guru_Koli21
supermanPunch
Anil_G
akashjariwala
mojo001
Ramya_25
tharani.natarajan
More details...

Trying to capture text data from a webpage - Cant seem to make it work

Related Topics