Category "r"

Counting observations by 30-days window

As I explained in previous posts I'm trying to count observations over 30 days windows grouping by id. The data: df<-structure(list(id=c(1,1,1,2),date=c("200

Can I run the "phylosig()" command in multiple columns and expect one outcome?

In today's episode of total despair we're going to try and define the phylogenetic signal for a bunch of traits. The tricky part is that the data frame consists

Selecting elements from list based on length in R

I have this list of ~270'000 elements with each element having a length of either 165 or 166. What I would like is to isolate into a separate dataset those elem

Counter for occasion within a day based on date/time by ID

Now updated with the code for the data frame - Thanks for the tip! structure(list(id = c(5L, 5L, 5L, 5L, 5L, 5L, 5L, 6L, 6L, 6L, 6L, 6L), date = c("2022-03-03"

How can I add geographic distance matrix to an igraph network?

I have an igraph network of 169 nodes (neighbourhoods). I've added some vertices attributes using: V(g)$attribute. I'm going to study the impact of the geogra

Fastshap summary plot - Error: can't combine <double> and <factor<919a3>>

I'm trying to get a summary plot using fastshap explain function as in the code below. p_function_G<- function(object, newdata) caret::predict.train(object

How can I filter rows out if their start date is within 90 days from today and place them out until the 1st of the following month in R?

I am having difficulty finding the words to describe what I am searching for but will try. I would like to solve the following using R or Python (but preferably

Assign variables in groups based on fractions and several conditions

I've tried for several days on something I think should be rather simple, with no luck. Hope someone can help me! I have a data frame called "test" with the fol

metafor: use of annosym argument in addpoly()

I draw a forest plot with non-default annosym values: res <- rma(ai=tpos, bi=tneg, ci=cpos, di=cneg, data=dat.bcg, measure = 'OR', method='DL', slab=p

How can I use a vector variable to avoid repeating references to non-adjacent columns in R?

Question In R, can I used a vector that holds the names of data frame columns to avoid repeated code? vec_columns <- c("col1", "col2", "col8", "col54") Bac

Ggplot2 rearranges wrongly the bars in a plot bar when value is null

Given the following data, I compose a data frame with a factor and a numeric column. X2 <- c(4,4,3,5,4,4,2,3,4,3,5,5,4,3,3,4,2,3,3,4,3,5,3,3,4,4,3,3,5,4,5,4,

Loading some packages with Rstudio server does not work, while it works with R server

I am using Rstudio server on a remote server and I have some packages already installed. When I try to load libraries like raster or terra using Rstudio server,

Using findElements() in the RSelenium package to select multiple dropdown items and using clickElement()

I am trying to use clickElements() from the RSelenium package in order to drop down all of the downwards facing arrows in the "distrito" dropdown panel. I can d

dplyr case_when throws error names' attribute [1] must be the same length as the vector [0]

I am running the following case_when inside a dplyr chain: open_flag = case_when ( open_flag == 0 & (click_flag > 0 | mirror_flag > 0) ~ 1, TRUE ~

Problem with `mutate()` input `index_date` when performing time series xgboost in R

here my reproducible example mydat=structure(list(supplier = c("TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP", "TKP",

How to get all transaction from an address via the etherscan API in R?

I've been looking for a way to download all transaction from a specific contract since their start up until now. It's around 25k transactions as of today. I've

Using Dplyr to calculate percent by group for every column without specifying the name?

This is similar to this. However what I'm interested is to calculate the percentage for every column. So for example when I do the below I can calculate column

How to remove pairs of rows corresponding to same value in R dataframe?

For the unique pair of ID, if both corresponding rows are 0, I need to remove them. In this case, remove row #5 and #6 but not row #7 and #8. tmt.pair <- c("

Using `pool` in an R Shiny package

How can I implement the use of a pool object in an R Shiny application structured like a package? I have structured my package according to the layout described

Convert string into binary vector in R

I'm trying to clusterize a set of journals by descriptors and I've been thinking of turning descriptors into a binary vector instead of using string distances (