'Can this infomation be scraped from this site - if so, what I am not seeing
I am not new to Python, but new to Scrapy and Splash.
Using Scrapy, I have successfully scraped static pages with tables, css and created .json files that were perfect. I have installed and been able to get Splash to work on two basic javascript sites.
I am not been able to get this site to work and would like some gentle guidance as to why. -- I put the URL into the Spash render page, and received HTML that does not include the data I am trying to get
These are three examples with the single value that I am trying to obtain.
https://www.mtonline.gov.bc.ca/mtov/map/mto/cwm.jsp?site=mem_mto_pla-view-title&titleId=1072148 The result I am looking for -- 092I04I037A
https://www.mtonline.gov.bc.ca/mtov/map/mto/cwm.jsp?site=mem_mto_pla-view-title&titleId=1072146 The result I am looking for -- 092I04I037B
https://www.mtonline.gov.bc.ca/mtov/map/mto/cwm.jsp?site=mem_mto_pla-view-title&titleId=1031671 The result I am looking for -- 092I04I037C
https://www.mtonline.gov.bc.ca/mtov/map/mto/cwm.jsp?site=mem_mto_pla-view-title&titleId=1026824 The result I am looking for -- 092I04I047A
Splash loads the page (via the render home page) and the returned Splash image does not include the map - I am lost how to load the map. I think I need to target specific Javascript -- Not quite sure.?
After it loads the map, I think I will need to 'click' on the information icon near the top left, which will load the data, then figure out how to scrap that - but that is after I get the map to load.
Again, I want to figure out the problem, but I am stuck at the point of it not loading the Map
Any suggestion or guidance would be appreciated
David
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|