Category "beautifulsoup"

Python terminal closes when importing BeautifulSoup

I have a simple python program, that is supposed to scrape some information from the internet and do stuff with it. When I run the code in PyCharm (IDE) it work

Why I'm getting "UnicodeEncodeError: 'charmap' codec can't encode character '\u25b2' in position 84811: character maps to <undefined>" error?

I'm getting UnicodeEncodeError: 'charmap' codec can't encode character '\u200b' in position 756: character maps to error while running this code:: from bs4 imp

How to ignore infobox when scraping title from Wikipedia anchor text?

I am trying to scrape the first 20 links on a Wikipedia page but I want to ignore the infobox on the right side. It has a 'table' tag. Here is what I have so fa

Webscraping Google Search Results Using Google API - Returns same result over and over again

My problem Hi everyone I am attempting to develop my very first web scraper using the Google API and Beautiful Soup in Python. The aim is for the scraper to

Scraping google play reviews

I am new to programming and I have recently tried to scrape google play reviews with python using the following program: from bs4 import BeautifulSoup import u

Transfermarkt webscrapping failed

I have created the code below to scrape data from Transfermarkt but it gives error ERROR IndexError Traceback (most recent call l

Transfermarkt webscrapping failed

I have created the code below to scrape data from Transfermarkt but it gives error ERROR IndexError Traceback (most recent call l

Not able to fetch <h3> ag from the below website using Beautiful Soup

I'm trying to fetch top 100 movie names, but not able to access h3 tag.How can I fetch it from this link? Edit - Using below code to extract h3 - import request

How is data scraping based on location in Amazon?

Whenever I want to scraping on amazon.com, I fail. Because Product information changes according to location in amazon.com This changing information is as follo

UnicodeDecodeError 'utf-8' codec can't decode byte 0x92 in position 2893: invalid start byte

I'm trying to open a series of HTML files in order to get the text from the body of those files using BeautifulSoup. I have about 435 files that I wanted to run

Scraping Wikipedia for information with Beautiful Soup

I managed to scrape wikipedia for names of US Presidents using Beautiful Soup. After which I converted them into dataframe. names=[all the president's name] wik

Extract business hours from Google using only beautiful soup

Goal Extract the business hours and its closed status from the Google Search results. Screenshot with the highlighted working hours and closed status (example U

how to use re.sub to replace matches with a series of numbers

I'm trying to remove all HTML tags from a text file and after some processing on the text , I have to put the HTML tags back in the text, So i thought maybe rep

BeautifulSoup: How to find all href links in a div with a class?

On disboard.org/ I am trying to collect all href's within a div with a class of 'server-name'. Source-Code: def scrape(): url = 'https://disboard.org/search

Need help parsing link from iframe using BeautifulSoup and Python3

I have this url here, and I'm trying to get the video's source link, but it's located within an iframe. The video url is https://ndisk.cizgifilmlerizle.com... i

Download bing image search results using python (custom url)

I want to download bing search images using python code. Example URL: https://www.bing.com/images/search?q=sketch%2520using%20iphone%2520students My python co

How to speed up python data parsing?

I have such a task - i need to parse the site in the form of a taxonomy and save to csv, that is, upload 24,000 links, that is, I uploaded 800 links to a file,

Scrape and change data in date in BeautifulSoup

I am scraping data from different web pages and there are several dates in this data. The code allowing me to have the information that I want looks like this,

Unable to iterate through list using BeautifulSoup

I am doing some experiments with Python3.6 in Mac and BeautifulSoup. I am trying to build a simple program to scrap song lyrics from a URL and store them as pla

Pulling company name from webpage within <a> tag

I am trying to streamline my data collection by using Python 3.7 and BeautifulSoup to pull company name, if that company is approved or other, and if they are m