I'm trying to get all information from this website using Python/Selenium: https://bitinfocharts.com/top-100-richest-bitcoin-addresses.html I have successfully
The task is to get data from the site. I have 800 URLs to request. But it takes a long time .I use aiohttp. At this stage, I have received links, by clicking on
Below is the xml that I'm trying to parse. <url> <loc>https://www.houseofindya.com/aqua-chanderi-pleated-sharara-pants-177/iprdt</loc> &
I would appreciate your help on this scraping problem. I would like to scrape this site: https://www.trustedshops.de/bewertung/info_XDAD39B874C275A0751198C2510C
the article passages are divided into different divs tags like in the image you can see it is written data-page-number="2" just like that the data is divided i
I have a lxml file and I need content from there. The file structure looks like this: <span class="ocr_line" id="line_1_1" title="bbox 394 185 1993 247">
I'm a web-scraping beginner and am trying to scrape this webpage: https://profiles.doe.mass.edu/statereport/ap.aspx I'd like to be able to put in some settings
import pandas as pd import glob import csv import re from bs4 import BeautifulSoup links_with_text = [] textfile = open("a_file.txt", "w") for filename in glob
import pandas as pd import glob import csv import re from bs4 import BeautifulSoup links_with_text = [] textfile = open("a_file.txt", "w") for filename in glob
I have the following code to scrape Reddit usernames: from bs4 import BeautifulSoup from requests import get from fake_useragent import UserAgent
I am trying to scrape the name of every favorites on the page of a user of our choice. but with this code I get the error "ResultSet object has no attribute 'fi
I am new to all of this, and this is my first real coding project so forgive me if the answer is obvious :) I am trying to extract sale items from [my grocery s
I'm trying to scrape my NYT mini crossword stats to then update a google sheet. But I'm having trouble with the login portion of the code. Here's my code so far
I am looking for a way to parse the images on a web page. Many posts already exist on the subject, and I was inspired by many of them, in particular : How Can I
I'm trying to get None if the class is not found in web scraping. For example, in some cases stage-codes.html#10_99 doesn't exist in HTML. for st in soup.find_a
I am new to python and am learning things slowly. I have earlier performed API calls from databases to extract infromation. However, I was dealing with a partic
I'm attempting to extract information from this website. I can't get the text in the three fields marked in the image (in green, blue, and red rectangles) no ma
I'm trying to get the text "PDF file" under <i class="fa fa-file-pdf-o">. I'm using BeautifulSoup and tried the following, but it didn't work: from bs4 im
I'm trying to get the text "PDF file" under <i class="fa fa-file-pdf-o">. I'm using BeautifulSoup and tried the following, but it didn't work: from bs4 im
I have been trying to scrape twitter usernames by going inside the followers page but the issue is if I leave my pc there after some time I get this exception a