Category "web-scraping"

How do I continue a for loop when no data is present which causes an index error?

I am learning python and I'm guessing something very obvious is at fault here. I have a web scraping program that, I suspect, upon finding an empty data field t

Trouble scrapping sports table - python

I'm having a lot of trouble scrapping results off a sporting table in Python. I am new to scrapping and have tried everything I can find online. The website is

How to get table and it's element with Python/Selenium

I'm trying to get all the price in the table at this URL: https://www.skyscanner.it/trasporti/voli/bud/rome/?adults=1&adultsv2=1&cabinclass=economy&

if class not found return none in beatifulSoup

I'm trying to get None if the class is not found in web scraping. For example, in some cases stage-codes.html#10_99 doesn't exist in HTML. for st in soup.find_a

Web scraping from html code of a database using python

I am new to python and am learning things slowly. I have earlier performed API calls from databases to extract infromation. However, I was dealing with a partic

Extracting information from website with BeautifulSoup and Python

I'm attempting to extract information from this website. I can't get the text in the three fields marked in the image (in green, blue, and red rectangles) no ma

Element is not getting selected in Selenium Python Firefox Web Scrapping

I have problem with element selection checkbox with one of the list item in below code snippet. I'm able to select 1st 2nd & 4th Element in below code, howe

How to web scrape the text under <i class>?

I'm trying to get the text "PDF file" under <i class="fa fa-file-pdf-o">. I'm using BeautifulSoup and tried the following, but it didn't work: from bs4 im

How to web scrape the text under <i class>?

I'm trying to get the text "PDF file" under <i class="fa fa-file-pdf-o">. I'm using BeautifulSoup and tried the following, but it didn't work: from bs4 im

getting NoSuchWindowException while scrapping twitter usernames using Selenium

I have been trying to scrape twitter usernames by going inside the followers page but the issue is if I leave my pc there after some time I get this exception a

How will I get the content ie the title of a tag while web scrapping with puppeteer?

I am following this tutorial. https://www.digitalocean.com/community/tutorials/how-to-scrape-a-website-using-node-js-and-puppeteer I am trying to learn how to u

Extract div class text and sub tables in rvest

I am trying to recreate a table from this website under "Battle Pass Rewards." The final result is a data.frame with each of the following areas as different co

Scroll inside div doesn't wrk with Puppeteer

I try to scroll area inside div using Puppeteer. I tried to follow these answers: https://stackoverflow.com/a/67490337 and https://stackoverflow.com/a/52031392

I've been trying to scrape profile pictures in Instagram using this code but i keep getting TypeError: 'NoneType' object is not subscriptable

import requests from bs4 import BeautifulSoup as bs User = input("input the username of the user "); url = 'https://instagram.com/' + User +'/' r = requests.get

Handling this website which is redirecting to the same url with BeautifulSoup

So I'm scrapping this website: https://www.hepsiburada.com/apple-macbook-pro-m1-cip-8gb-256gb-ssd-macos-13-qhd-tasinabilir-bilgisayar-uzay-grisi-myd82tu-a-p-HBV

Multiple possible values when searching item by XPath | Scrapy

I would like to find title bar icon with rel = 'icon' or 'shortcut icon'. So I'm trying to do something like this: response.xpath("head/link[@rel='icon' or 'sho

BeautifulSoup request is returning an empty list from LinkedIn.com/jobs

I'm new to BeautifulSoup and web scraping so please bare with me. I'm using Beautiful soup to pull all job post cards from LinkedIn with the title "Security Eng

Click on hyperlink when scraping data from a table

I am trying to scape data from a web table. I login to a website, which gives me access to a web table. That table contains a hyperlink column (Id) like the exa

Python & Selenium: How to get Elements in DevTools with CDP (Chrome DevTools Protocol)

I'd like to get all source code in Elements with Chrome DevTools. Although I tried the following code, these values are not match with the above code. body = d

i want to scrape another class if first class is not found (n/a) in beautifulsoup, how to code this?

I'm scraping Indiegogo to see how many backers there are. However, because there are two different formats, it first scrapes the content for the first layout, b