I'm an Environmental Engineer, trying to make a leap change to the data science area which interests me more. I'm new to Python, I work at a company that evalua
So I am just starting a data science/stats class and I am trying to setup a R notebook within Dataspell I am able to create a Jupyter notbook but it only wants
I have been tasked with analyzing the input flow in a water tank in relation to a number of weather parameters. In a narrower sense, I have to investigate any p
I cannot use read_csv method of pandas properly on kaggle. Error that I get is: ParseError: Error tokenizing data. C error: Buffer overflow caught - possible ma
I have to add a new row at the end of each person information. In the new row which we will add all the information will be same as last row like name, last_upd
I am trying to collect the community feed data from a channel for analytics. I couldn't find a way using the YouTube Data API v3. Is there a way to extract such
all code: def rgb2hex(r,g,b): return '#{:02x}{:02x}{:02x}'.format(r,g,b) def rg(num): num = int(np.round((num / 100) * 124)) r = (124 - num) g
I am currently performing multi class SVM with linear kernel using python's scikit library. The sample training data and testing data are as given below: Mode
I have two data frames. One data frame is called Measurements and has 500 rows. The columns are PatientID, Value and M_Date. The other data frame is called Pati
I have big 3 CSV files and they are all 76 same columns. The number of rows are different 17809 rows - 124262 rows - 108779 rows I am trying to merge these 3 d
Hello I am struggling to find a solution to probably a very common problem. I want to merge two csv-files with soccer data. They basically store different data
I'm trying to implement Integer Programming for Nearest Neighbor Classifier in python using cvxpy. Short intro Given a dataset of n points with a color (red or
I'm trying to implement Integer Programming for Nearest Neighbor Classifier in python using cvxpy. Short intro Given a dataset of n points with a color (red or
I am trying to use SMOTE in python to handle highly imbalanced data set. After splitting the data set into train and test I generate synthetic samples using SMO
I have a dataframe with over 280 features. I ran correlation map to detect groups of features that are highly correlated: Now, I want to divide the features to
When I try some codes in pandas, the bash code wget is used in colab as the following: import pandas as pd !wget abc.com/sales.csv If I want
I merged 3 different CSV(D1,D2,D3) Netflow datasets and created one big dataset(df), and applied KMeans clustering to this dataset. To merge them I did not use
I have a dataset in which one of its columns is Ex-Showroom_Price, and I'm trying to convert its values to integers but I'm getting an error. import pandas as p
I am trying to manage the results of machine learning with mlflow and hydra. So I tried to run it using the multi-run feature of hydra. I used the following cod
My Data is in this format(Both Multiple and Multivariate Timeseries) I need to predict number of units sold is gonna be for every product across different st