Category "web-scraping"

Contradictory rules in robots.txt

I'm attempting to scrape a website and these two rules seem to be contradictory in robots.txt User-agent: * Disallow: * Allow: / Does Allow: / mean that I can

Problem scraping Bet365 with headless mode

A few days ago I started the development of a bot to capture data/results from virtual sports (specifically football) at Bet365 (note: I know this is not allowe

Web scraping returns empty character

I am attempting to put together a real estate dataset, however when I try and extract the text from the website, for example the names of the places, it returns

Browser crashes on auto-scroll puppeteer

I have a scraper that scrapes the video urls of all the posts of a user. I have implemented auto-scroll so that all the videos are loaded for me to scrape. I te

scrapy post request not updating data in airtable

I need to create records in an airtable base and have the following code in scrapy: url = "https://api.airtable.com/v0/appuhKmlhLIIEszLm/Table%201" payload = j

Scraping newspaper article titles with google news

Below is my code for scraping news about domestic violence. This code worked perfectly the first time I used it. But back then I covered only 2-3 months period,

=importhtml Google Sheets table

Unfortunately I don't know how to source this and was wondering if someone could show me how. I am trying to learn hear so the correct answer is great, but the

How hunter.io works (find all email address from domain URL)?

I'm working on a tool to find all emails address from a domain name, as snov.io/hunter.io does. Do you have any idea about how to find all email address from th

Why is web scraping stock prices through beautiful soup returning a different price than the one on the Yahoo Finance page?

I am trying to write a program that will give me the stock price for a few different stocks, but when I run my program, it returns 116.71, while Yahoo Finance h

python selenium scraping a betting site

I recently started coding a program with python selenium. The goal of the project is to calculate arbitrage possibilitys between two sport bookies (but that doe

scraping with rvest with different number of rows in r

https://www.transfermarkt.de/alexander-bade/profil/spieler/31 Is it possible to scrape the whole table in one scrape?

=importxml, Website to Google Sheets - getting #N/A every time

Website Link https://redacted xml options I have tried so far <span aria-labelledby="amount">722</span> //*[@id="amount"]/h3/span[2] /html/body/di

node js puppeteer How do I scrape values ​from duplicate selector?

I want to scrape the value, but its selector is duplicate, I don't know how to solve. It will always scrape the value of the one above. here my code: const pupp

puppeteer bypass cloudflare by enable cookies and Javascript

(In nodeJs -> server side only). I'm doing some webscraping and some pages are protected by the cloudflare anti-ddos page. I'm trying to bypasse this page. B

How to publish a .ics file into my iPhone Calendar?

I have an iPhone and I recently discovered about exportable .ics files as a method to add events/calendars. I have searched Internet for a while and I didn't fi

how to use selenium on web apps built with gradio?

I am trying to click a button on selenium on a web app built using selenium. But ending up with error "element not found". I want to click the submit button usi

how do i press enter on a cr-button with selenium python

# import the webdriver from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.common.keys import Keys import time im

Scraping a simple website with colly in golang does not return any data

I'm trying to scrape a simple website that looks like this: <html> <head> </head> <body> <pre> "Name Surname 1 Name Su

AttributeError: 'NoneType' object has no attribute 'strip' - Scrapy doesn't crawl all the elements

My spider doesn't crawl all the elements. As I can see now, one of the errors is an attribute error which I don't know how to fix it. This is a non-English webs

Facebook Group Post Scraping Using Selenium Only Returns One Post

I'm in the process of building a Facebook Group Scraper, I have managed to write the code to log-in + scrape the name of the, but for some reason, my code is on