Category "data-science"

How to calculate values in Pandas Dataframe itself?

You can see my dataframe below, x values are different value, but other values are same with left values, for example, column 15 and column 16 are same value. I

Python cant find database if script is run from other file

I have this struggle with a dataheavy project. I can run a file that uses a query file -- Al the query's and converters are in here -- without problems, but whe

How to iterate over rows of each column in a dataframe

My current code functions and produces a graph if there is only 1 sensor, i.e. if col2, and col3 are deleted in the example data provided below, leaving one col

Finding percentage of rejection in pandas dataframe

I have a pandas data frame like given below Id1 YEAR CLAIM_STATUS no_of_claims 1 2019-01 4 1 1 2019-01 5 1

Apply ta_py function to Cudf dataframe - RAPIDS

trying to create a new column on a cudf dataframe based on VWMA from ta_py : #creating df CJ_m30 = cudf.read_csv("/media/f333a/Data/CJ_m30.csv",

Installing CUDA Windows 10

I am trying to install the CUDA toolkit in order to be able to use Thundersvm in my personal computer. However I keep getting the following message in the GUI i

Is there a scientific field dedicated to the quantification of intelligent behavior? [closed]

One of the biggest struggle with ML research is the creation of objective functions which capture the researcher's goals. Especially when talk

Is there a way to get rid of these loops in or-tools, and what other libraries that do optimization in python

This code block is from OR-Tools docs, and I want to remove these for-loops. Is there a way to vectorize the code? The issue here is that I expect to have the n

KeyError: 'initialized_diffuse'

I'm getting a keyerror 'initialized_diffuse' while calling the following API, probably after joblib.load(). import joblib .......... @routes.route("/forecast",

Hardware for Machine Learning / Deep Learning

I want to build a new computer for Data Science purposes. What do you think about this hardware: https://www.ldlc.com/configurateur-pc/23fe088422141bb69274a13ca

How do i plot a bar graphic with the groupby made column

I'm an Environmental Engineer, trying to make a leap change to the data science area which interests me more. I'm new to Python, I work at a company that evalua

Getting an R Notebook to work in Jetbrains Dataspell

So I am just starting a data science/stats class and I am trying to setup a R notebook within Dataspell I am able to create a Jupyter notbook but it only wants

Analyzing unevenly spaced timeseries

I have been tasked with analyzing the input flow in a water tank in relation to a number of weather parameters. In a narrower sense, I have to investigate any p

ParseError: Error tokenizing data. C error: Buffer overflow caught - possible malformed input file. (read_csv)

I cannot use read_csv method of pandas properly on kaggle. Error that I get is: ParseError: Error tokenizing data. C error: Buffer overflow caught - possible ma

How to add a new row after every unique entries in pandas dataframe

I have to add a new row at the end of each person information. In the new row which we will add all the information will be same as last row like name, last_upd

Extract YouTube Channel Community Feed Data

I am trying to collect the community feed data from a channel for analytics. I couldn't find a way using the YouTube Data API v3. Is there a way to extract such

Creating custom colourmap for geopandas.explore plot

all code: def rgb2hex(r,g,b): return '#{:02x}{:02x}{:02x}'.format(r,g,b) def rg(num): num = int(np.round((num / 100) * 124)) r = (124 - num) g

Plot scikit-learn (sklearn) SVM decision boundary / surface

I am currently performing multi class SVM with linear kernel using python's scikit library. The sample training data and testing data are as given below: Mode

Find closest datapoint to a date in another dataframe

I have two data frames. One data frame is called Measurements and has 500 rows. The columns are PatientID, Value and M_Date. The other data frame is called Pati

Network Flow Dataframe - Merging Memory Error - Unable to allocate array with shape and data type

I have big 3 CSV files and they are all 76 same columns. The number of rows are different 17809 rows - 124262 rows - 108779 rows I am trying to merge these 3 d

Category "data-science"

Other Categories