Category "pandas"

How to create a dummy only if a column has non-zero values for certain dates but zero for other dates

Let's say, I want to identify traders who only traded during bull runs but did not trade (zero values) during downturns or stable periods. Let's say we have two

Retrieve name of column from its Index in Pandas

I have a pandas dataframe and a numpy array of values of that dataframe. I have the index of a specific column and I already have the row index of an important

Python, Pandas and intersection - not PIVOT

This isn't a straightforward pivot question. I don't want to create new named columns (or numbered ones). What I am looking for is to find a way to search for

how to replace the particular column value using python

Input Data Frame output data frame in output, some particular column should be replace and other value of other column should be repeated

How to save each output by iteration of a loop in separate csv files

I have a file in which i calculate a startdate and enddate, these dates change by iteration each time between a range of dates. the code works fine but i don't

Transform a dataset from wide to long pandas

I have a little problem with the transformation from wide to long on a dataset. I tried with melt but I didn't get a good result. I hope that someone could help

Pandas: how to filter out rows containing a string pattern within a list in a column?

I have a data frame that looks similar to the following: df = pd.DataFrame({ 'employee_id' : [123, 456, 789], 'country_code' : ['US', 'CAN', 'MEX'],

populating dataframe with youtube api

list2=['PewDiePie', 'jacksepticeye', 'iDubbbzTV', 'Markiplier','MarkiplierGAME', 'EminemMusic','EdSheeran', 'TaylorSwift', 'CNN', 'FoxNews', 'CBCNews', 'ABCNew

Remove rows in dataframe based on value counts of unique identifier

Identifier Value_1 Value_2 Value_3 123 20. 30. 1 123. 12. 14. 1 123. 18. 12. 1 124. 12. 10. 6 124.

Python Dash - add submit button at the end of multiple inputs. Also format output table nicely

Want to create a dash board, with two or more inputs, a submit button at the end of inputs that outputs a table. I am having trouble creating properly spaced in

How to add new column to pandas group? Pandas forgets the column

I have a pandas dataframe indexed by createdAt grouped by pid (participant identifiers). The created at column are unix timestamps. Now I would like to add a ne

Creating a summary statistics table in python

I am trying to recreate the 'SummarySE()' function from R in python but I am having trouble getting it to work. The function creates a summary stats table from

Why ais my output "nan", keras model prediction

I'm trying to make an AI attempting to predict numbers from prime number sequence, but my model outputs "[[nan]]". My csv file is formatted like this: number of

Select how much data the pd.mean() displays?

When returning the pd.mean(), how to show only specific information? Example Now it gives this: new_df = pd.read_excel('example.xls', usecols = ['weight'] ) pri

NumPy array value error from training in Auto-Keras with StratifiedKFold

Background My sentiment analysis research comes across a variety of datasets. Recently I've encountered one dataset that somehow I just cannot train successfull

Combine Columns in Pandas

Let's say I have the following Pandas dataframe. It is what it is and the input can't be changed. df1 = pd.DataFrame(np.array([['a', 1,'e', 5],

Why is the output of the sklearn.feature_selection chi2 nan - can a feature with no variation not be compared to a feature with variation?

I want to build a heat map that correlates whether a feature is present in each column, with whether the feature is present in every other column. I have this:

ID3 algorithm: RecursionError: maximum recursion depth exceeded in comparison

I have this function to build ID3 decision tree: def buildTree(df,tree=None): Class = df.keys()[-1] #To make the code generic, changing target vari

Strange Plotly behaviour with Choropleth Mapbox

I want to create a choropleth map out of a GeoJSON file that looks like this: {"type": "FeatureCollection", "features": [ {'type': 'Feature', 'geometry': {'type

how to retrieve data from pandas

I am trying to retrieve the most recent data from the below results. The code I am running is: import datetime from Historic_Crypto import HistoricalData # Tim