Category "scrapy"

How can I handle pagination with Scrapy and Splash, if the href of the button is javascript:void(0)

I am trying to scrape the names and links of universities from this website: https://www.topuniversities.com/university-rankings/world-university-rankings/2021,

How can I handle pagination with Scrapy and Splash, if the href of the button is javascript:void(0)

I am trying to scrape the names and links of universities from this website: https://www.topuniversities.com/university-rankings/world-university-rankings/2021,

Is it possible to speed up move_to_element() in Selenium or what are other alternatives?

What is the fastest way to trigger an onmouseover event when scraping a webpage? So I want to move the mouse over a div element, which is then calling a javasc

Scrapy - ReactorAlreadyInstalledError when using TwistedScheduler

I have the following Python code to start APScheduler/TwistedScheduler cronjob to start the spider. Using one spider was not a problem and worked great. However

How to bypass a 'cookiewall' when using scrapy?

I'm a new user to Scrapy. After following the tutorials for extracting data from websites, I am trying to accomplish something similar on forums. What I want

How to Run Python Script (Scrapy) From Ktor

What I'm trying to do: Android Application (ADMIN) that gets job Title from user and fetches all the jobs related to it using Scrapy (Python) which are saved to

Python Scrapy Web Scraping : problem with getting URL inside the onclick element which has ajax content

I am beginner for the web scraping with scrapy . I try to scrape user reviews for specific book from goodreads.com . I want to scrape all of the reviews about b

Scrapy: command to overwrite previous export file

Set-up I export my data to a .csv file by the standard command in Terminal (Mac OS), e.g. scrapy crawl spider -o spider_ouput.csv Problem When exporting a

How do I scrape dynamic search results page with scrapy? [duplicate]

I'm trying to scrape the results from the website https://howlongtobeat.com/#search. However, when I scrape, only the first 6 results only out

Why does scraping a Persian website with a non-English URL generate errors?

I am attempting to scrape a Persian website with the following code: import urlparse, urllib parts = urlparse.urlsplit(u'http://fa.wikipedia.org/wiki/ص&#

pyconfig.h missing during "pip install cryptography"

I wanna set up scrapy cluster follow this link scrapy-cluster,Everything is ok before I run this command: pip install -r requirements.txt The requirements.tx

How do I call scrapy from airflow dag?

My scrapy project runs perfectly well with 'scrapy crawl spider_1' command. How to trigger it (or call the scrappy command) from airflow dag? with DAG(<args&

Scrapy Import Error: No Module named Scrapy

I installed scrapy using the command line pip install git+https://github.com/scrapy/scrapy@master --no-dependencies --upgrade once installed, I tried importi

How to Load more/show more pagination with scrapy-selenium

Getting response but scraping nothing! import scrapy from scrapy.selector import Selector from selenium import webdriver from selenium.webdriver.common.action_c

Nested for loop stop if null value using Scrapy

I use a nested for loop to get data of Weekdays. If one of the days is 'null' the loop stops at that day and doesn't get the rest of the days. I believe that I

How to use scrapy to scrape google play reviews of applications?

I wrote this spider to scrape reviews of apps from google play. I am partially successful in this. I am able to extract the name, date, and review only. My ques

How to do Scrapy historical output comparison using Spidermon

So Scrapinghub is releasing a new feature for Scrapy quality insurance. It says it has historical comparison features where it can detect if the current scrape

Scrapy: No module named 'scrapy.contrib'

I've looked everywhere for a solution to this. I didn't used to have a problem calling "from scrapy.contrib..." but now it throws this error. File "<frozen

Pyinstaller error on scrapy?

I am using scrapy importing it. I built the python file using pyinstaller. After building it I ran the file ./new.py. But the error pops: FileNotFoundError: [

No such file or directory error using pyinstaller and scrapy

I have a python script that uses scrapy and I want to make it into an exe file using pyinstaller. The exe file is generated without any error but when I open it