Category "dataframe"

Problem with websocket output into dataframe with pandas

I have a websocket connection to binance in my script. The websocket runs forever as usual. I got each pair's output as seperate outputs for my multiple stream

df.iloc causes error when used to perform second calculation

I am opening a .csv file and pulling it into a pandas dataframe (in this case there are 87 rows, 0-86). I want to perform separate calculations with the content

I am trying to merge two dataframes

I have this dataframe firm formtype Date_Filed GameStop Corp. 8-K 2021-04-01 I want to change the Date_Filed to 2021-04-01 00:00:00. I am using

Calculations on a pandas DataFrame column conditional on another column

I notice several 'set value of new column based on value of another'-type questions, but from what I gather, I have not found that they address dividing values

Subsetting dataframe with grep

I have following data Sample_ID<-c("a1_01_01","a2_03_03","a3_07_07","a4_09_09","a5_10_10","a6_21_21") Sex<-c(M, M, F, F, M, NM) DF1<-data.frame(Sample_

ValueError: row index exceeds matrix dimensions sparse coo max

I really have no idea what's the root cause! I have created below matrix and had tried increase the (M, N) size, or reduce the data size or the row size or colu

DataFrame challenge: mapping ID to value in different row. Preferably with Polars

Consider this example: import polars as pl df = pl.DataFrame({ 'ID': ['0', '1', '2', '3', '4', '5','6', '7', '8', '9', '10'], 'Name' : ['A','','','','B

How to evenly spread out date data (pandas)

I'm working on a project and I'm struggling with some formats of dataframes. I have two dataframes, each containing a different number of months. I want all the

How to create a dummy only if a column has non-zero values for certain dates but zero for other dates

Let's say, I want to identify traders who only traded during bull runs but did not trade (zero values) during downturns or stable periods. Let's say we have two

Python, Pandas and intersection - not PIVOT

This isn't a straightforward pivot question. I don't want to create new named columns (or numbered ones). What I am looking for is to find a way to search for

Calculate and return the average of positive, negative, and neutral

I have the following dataframe: enter image description here I am trying to have three additional columns in which they return sum of instances of 0, 1-, and 1

Transform a dataset from wide to long pandas

I have a little problem with the transformation from wide to long on a dataset. I tried with melt but I didn't get a good result. I hope that someone could help

populating dataframe with youtube api

list2=['PewDiePie', 'jacksepticeye', 'iDubbbzTV', 'Markiplier','MarkiplierGAME', 'EminemMusic','EdSheeran', 'TaylorSwift', 'CNN', 'FoxNews', 'CBCNews', 'ABCNew

Combine Columns in Pandas

Let's say I have the following Pandas dataframe. It is what it is and the input can't be changed. df1 = pd.DataFrame(np.array([['a', 1,'e', 5],

How to display dictionary with dataframes on a localhost (using Flask and Python)?

I have the following dict: {'id': 1, 'df': pd.DataFrame({'id': [1,2,3], 'col1': ['kuku', 'dudu', 'lulu'], 'col2': [8,9,10]}), 'df_size': 3} When I am trying to

Strange Plotly behaviour with Choropleth Mapbox

I want to create a choropleth map out of a GeoJSON file that looks like this: {"type": "FeatureCollection", "features": [ {'type': 'Feature', 'geometry': {'type

Finding percentage of rejection in pandas dataframe

I have a pandas data frame like given below Id1 YEAR CLAIM_STATUS no_of_claims 1 2019-01 4 1 1 2019-01 5 1

How to apply code to dataframe by condition?

I have the following dataframe: library(dplyr) library(tidyverse) library(concordance) Year <- c(2016,2016,2017,2019,2020,2020,2020,2013,2010,2010) Pf <-

Visualization random sample with displaCy

How can I visualize using displaCy in a dataframe? I have a data called taks_output and want to visualize a sample of the columm msg_lower? What I did: import p

Reshape wide to long for many columns with a common prefix

My frame has many pairs of identically named columns, with the only difference being the prefix. For example, player1.player.id and player2.player.id. Here's an