Category "dataframe"

Error - replacement has [x] rows, data has [y]

I have a numeric column ("value") in a dataframe ("df"), and I would like to generate a new column ("valueBin") based on "value." I have the following condition

convert float64 (from excel import) to str using pandas

although the same question has been asked multiple times. I dont seem to make it work. I use python 3.8 and I rean an excel file like this df = pd.read_excel(r"

convert float64 (from excel import) to str using pandas

although the same question has been asked multiple times. I dont seem to make it work. I use python 3.8 and I rean an excel file like this df = pd.read_excel(r"

Deleting multiple rows under same App Name but with different number of reviews

I have a dataframe having many columns, 2 of them being 'App' and 'Reviews'. I discovered that for the same app there are multiple rows because they differ in t

Perform a merge by date field without creating an auxiliary column in the DataFrame

Be the following DataFrames in python pandas: | date | counter | |-----------------------------|------------------| | 2022-01-0

Concat null columns data with actual data in pandas?

I have set of columns need to be merged into single column where some columns have data and some don't have where it should be joined with the data to single co

pandas, creating dataframes based on tuple

I have a tuple that has data for several categories. Now I want to extract small dataframes from this tuple for each category based on a list I created. I want

Count occurrences within a specific range

I have a data frame that looks like this: Tag 0 skip_1 1 run 2 skip_1 3 run 4 skip_1 5

Multiply without eliminate information

I have a dataframe and I would like to maintain information. My data frame is like: a <- c("a","b", "c", "d") b <- c("e","f", "g", "h") c <- c(1, 2, 1,

Pandas Value Error: Cannot set item on a Categorical with a new category, set the categories first

I've been looking for other similar issues on this ValueError, but none of them has the same code as I have. So here it is. As I am still very new at this, I am

How to find the number of seconds elapsed from the start of the day in pandas dataframe

I have a pandas dataframe df in which I have a column named time_column which consists of timestamp objects. I want to calculate the number of seconds elapsed f

Python calculate increment rows till a condition

How to obtain the below result. Sample Data with Output Time To default is the column which is to be calculated. We need to get the increment number as Time to

Pandas Group by index Hour and keeping observation for each hour

I have a pandas dataframe containing one column and a datetime index, i need to group the data by hour and keep each obsevation (record) for each of the grouped

ParseError: Error tokenizing data. C error: Buffer overflow caught - possible malformed input file. (read_csv)

I cannot use read_csv method of pandas properly on kaggle. Error that I get is: ParseError: Error tokenizing data. C error: Buffer overflow caught - possible ma

how to add columns and values in a dataframe in python

In the below JSON array { "data": [ { "name": "page_call_phone_clicks_logged_in_unique", "period": "lifetime", "values": [ {

Count all NaNs in a pandas DataFrame

I'm trying to count NaN element (data type class 'numpy.float64')in pandas series to know how many are there which data type is class 'pandas.core.series.Seri

Review the \n (newline) values with proper representation in DataFrame

I have this: test = ['hey\nthere'] Output: ['hey\nthere'] And when I insert in into the DataFrame it stays the same way: test_pd = pd.DataFrame({'salute': test

Python code to return element value in dataframe based on another dataframe

I have a dataset similar to this generated from a file with yearly data d1 = pd.DataFrame({'category': ['A', 'B', 'C', 'D', 'E', 'F'], 'col

How to add new edges to the stellargraph dataset?

I need to add some extra edges to Cora dataset using stellargraph. Is there ane way to add edges to the current dataset in stellargraph library? import stellarg

How to filter for variables in a column of one df from another df's column with unequal length in R?

I am trying to select for variables in a column of a DF using the variables from a column in another DF with different length. I am using Dplyer to filter. DF1