Does anyone have any insight? New to using Weka KnowledgeFlow but I've followed the official documentation to a t. I'm trying to generate multiple ROC plots, wh
I'm running a toy model for learning, on Ubuntu 21.10, in a conda environment that comprises python 3.74, keras 2.4.3 and talos 1.0, among many other packages.
I have a code that basically takes in a csv, which can be uploaded from streamlit and then pushes out a classification prediction. Just as a context I use xgboo
my code is import streamlit as st import pickle import string from nltk.corpus import stopwords import nltk from nltk.stem.porter import PorterStemmer ps = Port
I have a dictionary of 100+ dataframes all have the same shape [9999 rows x 4 columns] with the following columns: ['time', 'response', 'arrival_1','arrival_2']
I have a dictionary of 100+ dataframes all have the same shape [9999 rows x 4 columns] with the following columns: ['time', 'response', 'arrival_1','arrival_2']
import pandas as pd import matplotlib.pyplot as plt from sklearn.model_selection import train_test_split,cross_val_score from sklearn.tree import DecisionTreeCl
I'm trying to solve the cannibals and missionaries problem in python (with some additional criteria, but the main idea is the classic one). So, in the class Gra
I'm trying to convert a predicted RasterFrameLayer in RasterFrames into a GeoTiff file after training a machine learning model. When using the demo data Elkton-
I am fairly new to coding and getting confused between average accuracy and overall accuracy. I have created a function to calculate accuracy, i then divide thi
I want to perform a random search, in classification problem, where the scoring method will be chosen as AUC instead of accuracy score. Have a look at my code f
I need you guys help to find a non linear decision boundary. I have 2 features with numerical data, I made a simple linear decision boundary (see picture below)
Problem statement: I have 150k points in a 3D space with their coordinates stored in a matrix with dimension [150k, 3] in mm. I want to find all the neighbors o
As an example, suppose there is a random forest and a logistic regression model that accept the same input data, and I want the inference result to be the avera
I'm following the book Hands-on Machichine Learning by Aurelien Geron, more specifically, where it begins to go into classifiers. I'm following the code from th
How can I use from scipy.stats import multivariate_normal to generate data? In specific, I want to create a GMM data that contains 3 columns (features) and a la
How to compute similarity(percentage) between two matrix/arrays. or find the closest array/matrix to a given array, on the basis of how similar their data value
I have defined few parameters in my config.yaml like as below. params: epochs: 10 batch_size: 128 num_classes: 10 loss_function: sparse_categorical_cros
I want to calculate shap values from a sklearn pipeline with a preprocessor and a model. When i do it with the code below I get 0 for all shape_values def creat
In scikit-learn, the GaussianMixture object has the method bic(X) that implements the Bayesian Information Criterion to choose the number of components that bet