Category "pandas"

pandas | list in column to binary column

I have the following dataframe: +------------+------------------+ | item | categories | +------------+------------------+ | blue_shirt | ['red', 'wh

How to find current upper Bollinger band in pandas-ta

I have a CSV file having columns Instrument, Date, Time, Open, High, Low, Close I want the rows having Current close greater than current upper Bollinger band(2

custom method for voting bewtween multiple csv files

I have 3 (or more) Dataframes with this structure: ID Percentage 00001 3 00002 15 00003 73 00004 90 ... ... each csv have unique predicted percentage values a

Pandas Dataframe Categorical data transformation

I am having pandas dataframe as follows: import pandas as pd # dictionary with list object in values # Item=[Item1, Item2, Item3] details = { 'Date' : [

Use Index of the Loadings (IL) method to calculate the optimal number of principal components

I want to use Index of the Loadings method to calculate the optimal number of principal components using the index_of_loadings_matrix function. My dataframe was

Exploding multiple dict columns and concatenating with original Pandas data frame

I export a Postgres SQL query to create a Pandas data frame df similar to the following: df = pd.DataFrame({ 'employee_id' : [123, 456, 789], 'country_c

Change values of column in df using conditional in two columns

I'm having the following problem: I'm working with a dataset that can be found at https://www.kaggle.com/datasets/ricardomattos05/jogos-do-campeonato-brasileiro

Efficiency of multiple chained str transformation and alternatives

I'm wanting to change a dataframe column so the values are lower case and also have their whitespace stripped. For this I used chained str transformations. df.l

Change NaN to None in Pandas dataframe

I try to replace Nan to None in pandas dataframe. It was working to use df.where(df.notnull(),None). Here is the thread for this method. Use None instead of np.

Change NaN to None in Pandas dataframe

I try to replace Nan to None in pandas dataframe. It was working to use df.where(df.notnull(),None). Here is the thread for this method. Use None instead of np.

Python Pandas - Lookup a variable column depending on another column's value

I'm trying to use the value of one cell to find the value of a cell in another column. The first cell value ('source') dictates which column to lookup. import p

Predicting with SMOTE

If I have a training dataset that has 1083 samples and a testing dataset that has 79871 samples, how do I go about making the samples equal? I have been using S

pandas read_csv throwing ValueError: Invalid file path or buffer object type: <class 'list'>

I want to read a csv file sent as a command line argument. Thought I could directly use FileType object of argsprase but I'm getting errors. from argparse impor

Repeat rows in a pandas DataFrame based on column value

I have the following df: code . role . persons 123 . Janitor . 3 123 . Analyst . 2 321 . Vallet . 2 321 . Auditor . 5 The first line means that I hav

Extracting a .7z File into a Pandas Data Frame

I am Using a Jupyter notebook (google colab) to try and extract data from a .7z file into a pandas dataframe, using linux commands. The data is from http://untr

Calculate Decay Rate in Python

I have dataset which somewhat follows an exponentional decay df_A Period Count 0 1600 1 894 2 959 3 773 4 509 5 206 I want

Create numpy array from function applied to (multiple) pandas columns

I have pd.DataFrame containing rows of values: import pandas as pd df = pd.DataFrame({"col1": [1, 2, 3, 4, 5, 6], "col2": [6, 5, 4, 3, 2, 1]}) I now want to f

pandas exlewriter.book does not read my excel file and even break the existed file

I want to stack a series of dataframe in one excel file and I wrote the code below. if os.path.isfile(result) is False: with pd.ExcelWriter(result, engine='

Functional Programming: How does one create a new column in a multi-index data frame that is a function of another column?

Suppose the below simplified dataframe. (The actual df is much, much bigger.) How does one assign values to a new column f such that f is a function of another

Pandas+Uncertainties producing AttributeError: type object 'dtype' has no attribute 'kind'

I want to use Pandas + Uncertainties. I am getting a strange error, below a MWE: from uncertainties import ufloat import pandas number_with_uncertainty = ufloa