Category "web-scraping"

Python & Selenium: How to get values generated by JavaScript

I use Selenium in Python for scraping. I can't get values though these values are displayed on the browser. So I checked the HTML source code, then I found that

Scrape Website that is running meteor, using python requests

You see, there is this website called edabit. All you need to know is that I want to scrape the website without using selenium. I want to learn how. What does s

Why does my Selenium script work only every other time?

I think I should start off by mentioning I have Mac OS and am using Safari browser. This script is a web-scraper that is supposed to search for a given type of

How to select and scrape specific texts out of a bunch <ul> and <li>?

I need to scrape "2015" and "09/09/2015" from the below link: lacentrale.fr/auto-occasion-annonce-87102353714.html But since there are many li and ul, I cant sc

Selenium won't load specific webpages on bet365

I am trying to scrape data from bet365 for basketball odds however I am encountering where certain leagues webpages wont load even when simply just loading the

How to scrape sofifa website positions. Text inside of span beautiful soup

So I am webs scraping the sofifa website into a workable csv. Each player gets a column. My main problem is the position section of the website is only exportin

Scrapyd-client schedule.json produces an AttributeError

I am trying to deploy my Scrapy spider using Scrapyd and Scrapyd-client. I have managed to successfully create a project containing my spider, but when i try to

Webscrap product price with javascript

im making a blog were i will show a product and all of it's prices on all the websites that sell this product, but i whant the prices to be up to date so i deci

Selenium cant find an element unless the page is inspected

I am automating with selenium and noticed something weird, on a specific webpage, there is a ul list containing some li roles that are buttons. For some reason,

Selenium Webhook equivalent / listen to new data from telegram without bot

I have a web scraping project that requires the scraper to wait for when there is new data. I thought about reverse engineering the websocket but that might tak

Scraping webpage with tabs that do not change url

I am trying to scrape Nasdaq webpage and have some issue with locating elements: My code: from selenium import webdriver import time import pandas as pd driver

How to extract the price in selenium if they have same tag?

I would like to get first prices ($916.65) in pyhton but I do not know which method do I need to apply ? Each price has the same tag (div class="price"). Shoul

Arbitrary Value JsonDiffPatch.net diff function output

I am trying to find the changed values on website. For that I have written a iterator which will hit the website and get the html output. I have saved that html

how to get the value of a specific column of each row in a html table?

I need to get the sensitivity (7th column) of each setting (row) site: https://prosettings.net/cs-go-pro-settings-gear-list/ table id: "table_1" 2 rows class: "

How to fix [WARNING]: Timed out connecting to Chrome, retrying

I access Websites in a loop via selenium Java based. Some of the sites crash imediately so that i get the error [1618982990.911][WARNING]: Timed out connecting

Can´t access pseudo ::before ::after element in Selenium

I want to get the text within a var tag which lies within a pseudo class: ::before ... <var class="added">Element</var> .. ::after I tried How locat

Web Scraping Google Scholar Author profiles

I have used scholarly package and parsed the author names generated in the 3 question its method search by author name to get the author profiles including all

Working code gives error when run on any other PC

I have working code that requests information from a website. When I send the file to another PC and run the code, I get: "Run-time error'91': Object variable

Contradictory rules in robots.txt

I'm attempting to scrape a website and these two rules seem to be contradictory in robots.txt User-agent: * Disallow: * Allow: / Does Allow: / mean that I can

Problem scraping Bet365 with headless mode

A few days ago I started the development of a bot to capture data/results from virtual sports (specifically football) at Bet365 (note: I know this is not allowe