Category "dplyr"

Create unique row values in new column based on matching criteria in R

I have a dataframe with one identifier column of unique values, and one column which contains specific criteria. I want to create a new identifier column of uni

Retain all columns after using group_by summarise, and mutatue dplyr on categorical variable and plot barplot with confidence intervals

I'm new to R. This is my dataset df <- tribble( ~Area_of_interst ,~Meds,~Response, "Internal Med", "asprin", "yes", "Inter

How to apply code to dataframe by condition?

I have the following dataframe: library(dplyr) library(tidyverse) library(concordance) Year <- c(2016,2016,2017,2019,2020,2020,2020,2013,2010,2010) Pf <-

How to use mutate() +across() only for specific rows

The data: df <- tribble( ~name, ~val.I, ~val.V, ~`val.%`, "Peter", 123, 12.4, 14, "Peter in %", 111, 532, 57, "Harald", 2222, 3333, 444, "Harald in

Summing across in a dataframe with condition coming from another column

this is not a very good title for the question. I want to sum across certain columns in a data frame for each group, excluding one column for each of my groups.

How to filter out a row if there are two consecutive instances of the same value?

I have a data frame with multiple similar sequences in which column Z has a string pattern containing "VALUE1" and "VALUE2" (only these two patterns matter) and

Multiples rows to one row in R [closed]

In R, I have a data frame with several values. I would like to have a data frame that transforms the data frame into a data frame with just on

Ways to select multiple columns in base R using the native pipe |>?

What are good ways to select multiple columns of a data frame in base R using the native pipe |>? (i.e., without the tidyverse/dplyr to reduce external depen

How to insert rows in specific indices of dataframe containing sum of few rows above only in R pipe dplyr

for dataframe below, df <- data.frame(id = c(rep(101, 4), rep(202, 3)), status = c("a","b","c","d", "a", "b", "c"), wt = c(10

How can I filter rows out if their start date is within 90 days from today and place them out until the 1st of the following month in R?

I am having difficulty finding the words to describe what I am searching for but will try. I would like to solve the following using R or Python (but preferably

How can I use a vector variable to avoid repeating references to non-adjacent columns in R?

Question In R, can I used a vector that holds the names of data frame columns to avoid repeated code? vec_columns <- c("col1", "col2", "col8", "col54") Bac

str_detect removing some but not all strings with specified ending

I'd like to remove any string that ends in either of 2 characters in a pipe. In this example it's ".o" or ".t". Some of them get removed, but not all of them, a

Problem with `mutate()` input `index_date` when performing time series xgboost in R

here my reproducible example mydat=structure(list(supplier = c("TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP",

Using Dplyr to calculate percent by group for every column without specifying the name?

This is similar to this. However what I'm interested is to calculate the percentage for every column. So for example when I do the below I can calculate column

How to remove pairs of rows corresponding to same value in R dataframe?

For the unique pair of ID, if both corresponding rows are 0, I need to remove them. In this case, remove row #5 and #6 but not row #7 and #8. tmt.pair <- c("

Organize columns with numerical info in colnames via dplyr relocate

I have a large amount of annual data in a data frame that will only get larger. I would like to organize it, grouping columns according to the year, which is

Multiply columns in dataframe by values given in another dataframe

I've been trying to accomplish this in dplyr but not been able to figure it out. In one data frame (df1) I have columns with values. |A |B | |23 |43 | |24 |11

How to count unique entries in a column across multiple columns in R

I have a data set that looks something like this data set example I am trying to find unique entries in each of the columns I managed to do it for 1 column util

How do you populate missing dates for lag?

Say that I have a dataset. date <- c("2004-02-01", "2004-03-05", "2004-08-09", "2004-08-13", "2004-10-20", "2004-11-02", "2008-01-05", "2008-02-03", "2008-08

In a dashboard made with R, how can I filter/unfilter based on selectInput?

I have created a simple dashboard (link here) to display some data in R using selectInput. How can I activate/deactivate a filter reactively, so that the output

Category "dplyr"

Other Categories