Category "pandas"

How to find minimum of some variable with repeating row indexes and preserve all other variables in Python Pandas

Basically, I have multiple repeating dates and the indices (1/2/1990 many times followed by 1/3/1990 many more times, etc.) I want to find the minimum of a give

searching in range between columns using sqlite3 in pandas

I have found solution to my problem in one question Merge pandas dataframes where one value is between two others I tried to modify it for my situation but it d

What's the equivalent of `pandas.Series.map(json.loads)` in polars?

Based on the document of polars, one can use json_path_match to extract JSON fields into string series. But can we do something like pandas.Series.map(json.load

Python Pandas SUMIF excel equivalent

I can't figure out how to achieve a certain task in my python script. I have a dataframe that contains media coverage for a specific topic. One of my columns na

Iterate trough a converted datetime pandas dataframe with a external function

https://rhodesmill.org/skyfield/positions.html#azimuth-and-altitude-from-a-geographic-position Hi I have function that generates a sun-shot azimuth on a specifi

how to access a specific data in two columns using if and statement

My Data Frame My Code: a = 10001 b = "01.01.2001" if a == np.any(df["Token_ID"]) and b == np.any(df["Date_of_birth"]): print("yes") else: print("no")

How to use python to merge multiple sheets from an excel file and values from particular cells

I have an excel file with multiple sheets, the actual data I need from each sheet is from cell B7 to F38, how can I merge all the sheets' data into one by using

Convert date format from a 'yfinance' download

I have a yfinance download that is working fine, but I want the Date column to be in YYYY/MM/DD format when I write to disk. The Date column is the Index, so I

Add missing dates to pandas dataframe

My data can have multiple events on a given date or NO events on a date. I take these events, get a count by date and plot them. However, when I plot them, my

Find a pattern in middle of multiple sentences

I have a dataframe as below, data = [ [ 1, 'AR-123456' ], [ 1, '123456' ], [ 2, '345678' ], [ 3,'Application-12345678901'], [ 3, '1234567890

Trying to find a graph in matplotlib

I have data that show the difference of temperatures from 1955 to 2020 from an average. I want to make a graph in matplotlib that looks like this: It shows tem

How do I use pandas to compare a list of numbers to a column in an excel document?

I have this simple script that creates a list by taking the values out of the column of excel_1 and compares it to the values of a column in excel_2. I want it

Pandas group by one column and repeat the values of another column

I was trying to divide the month into two weeks. Basically for each month i am trying to create week numbers like 1,2,3,4 and repeat them. How to create the req

Append nanosecond to millisecond Python datetime object

I am trying to append nanoseconds to an already existing millisecond datetime pandas object. So, for instance, I already have 08:02:36.715647 which reports upti

Dataframe is Offset by -1 Days From Source Data

I am using a connector to query some tables in Dynamics 365 Business Central and when I view my dataframe all of my dates are offset by -1 days. I generated a l

Removing Non-English Words from CSV - NLTK

I am relatively new to Python and NLTK and have a hold of Flickr data stored in CSV and want to remove non-english words from the tags column. I keep getting er

How to obtain all gaps as start .. stop interval in pandas datetime index

I want to find all gaps in pandas DateTime index as a list of intervals. For example: '2022-05-06 00:01:00' '2022-05-06 00:02:00' <- Start of gap '2022-05

Split translation results with pandas in Google Colab

Hi everyone I'm doing a traslation of words in csv/excel files using Google Colab and Pandas here is my code: import pandas as pd from googletrans import Transl

Use rows values from a pandas dataframe as new columns label

If I have a pandas dataframe it's possible to get values from a row and use it as a label for a new column? I have something like this: | Team| DateTime| Score

How to count number of events in a dataframe before and after a given date?

I'm trying to identify individuals who have events before or after events of their first occurrence of an event of a specific type. For example, I'm interested