Category "pandas"

How to select all columns whose names start with X in a pandas DataFrame

I have a DataFrame: import pandas as pd import numpy as np df = pd.DataFrame({'foo.aa': [1, 2.1, np.nan, 4.7, 5.6, 6.8], 'foo.fighters': [0

How to display an error message when pd.read.csv fails

I use pd.read_csv to fetch GCS data. However, when the file size is too large or something, the python task force quit automatically at the line using pd.read_c

Get specific rows which match condition pandas [duplicate]

I have the following dataframe My current code is as follows: Outcome is to only show instances where ImageFileName is services.exe and the P

How to join two very large dataframes together with same columns?

I have two datasets that look like this: df1: Date City State Quantity 2019-01 Chicago IL 35 2019-01 Orlando FL 322 ... .... ... ... 2021-07 Chicago IL 334 202

Get records that are a time interval away from a given date and specific conditions on a pandas DataFrame

Let it be the following Python Panda DataFrame: | ID | date | direction | country_ID | |-----------|-------------------------|----

LabVIEW generated Excel files unable to be read by pandas.read_excel

I'm having an issue when I save LabVIEW outputs as .xlsx files. When I try to import with Pandas I get the following error: expected <class 'openpyxl.styles.

how to get a single value from dataframe only in Python

I have dataframe df_my that looks like this id name age major ---------------------------------------- 0 1 Mark 34 Engli

Python dataframe to Google Sheet: Freeze rows and sketch vertical lines around a sets of columns

I have a dataframe df_final (it is a time series data) as shown in attached photo. I am able to write this dataframe to the Google Sheet using the code below. Q

How can I plot a line chart with two different companies monthly sales from 12 months before and 12 months after

The dataframe is a snapshot of what I'm working with. What I want to achieve is a line chart of the monthly sales of the different brands. Months with '-' befor

Get only those rows from a pandas data frame where a particular value is not present in a data frame column of type list

One of my columns in dataframe is of type list. I want to get only those rows where color is not red.Output should give row 2 and 3. I tried def Removecolo

can i make my scraping (pandas read html) script faster?

i have a very simple script, it just scrapes some tables off the internet and inserts it into a db. however: tickerlist contains about 8000 rows. and the script

Selecting a subset of a dataframe based on a list - pandas

I am working with a large dataframe (ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/bacteria/assembly_summary.txt) with pandas in Python 3, using PyCharm. The column

iterate over a df and multiply the values by the values of another df

My df1 looks like this:It contains 3 unique project id.The date starts on 01-01-22 and ends on 01-12-28 id date p50 p90 apv1 01-01-22 1000 1000 apv2 01-01-22 1

filling missing data in pandas data set

I have annual stock data and some of the years, datas are missing for the columns : at, ebit, lt , ni, re, wcap. How can I fill these missing slots with the dat

ValueError: shapes and not aligned: (dim 2) != 4 (dim 0)

I am currently working on a script that does some array manipulating and calculations for modeling. I am running into an error and unsure how to solve it. from

Python 3 does not show the barchart

I am trying to make this bargraph appear in the python run screen. But for some reason, it does not show the graph on the screen. However, if I put this code on

Cannot get a file to be read into a list of stock tickers and then get yfinance data for each

I am trying to read a csv file into a dataframe and then iterate over each ticker to get some yahoo finance data, but I struggle with matching the right data ty

How to get colored legend with seaborn barplot

There is a problem with plt.legend in seaborn and matplotlib. What is wrong with it? I can't see the legend color. plt.figure(figsize=(30,10)) plt.xticks(rotati

Find differences between a set of csv files in folder 1 against a set of csv files in folder 2?

There are a number a files that need to be compared for differences in their rows; difference not as in subtraction but as in what values are different for each

Calculating rolling XNPV in Python

I need to calculate XNPV of cash flows at different future dates. Is there any function to do this in numpy, pandas or plain python? Consider a dataframe like s

Category "pandas"

Other Categories