When the data is binary, setting the ranager argument case.weights helps to minimize the issues produced by class imbalance. This function takes a binary response variable and returns a vector of weights populated with the values 1/#zeros and 1/#ones. It is used internally by the function rf().
case_weights(data = NULL, dependent.variable.name = NULL)Data frame with a response variable and a set of predictors. Default: NULL
Character string with the name of the response variable. Must be in the column names of data. Default: NULL
A vector with a length equal to nrow(data) with the respective weights of the cases.
if(interactive()){
data <- data.frame(
response = c(0, 0, 0, 1, 1)
)
case_weights(
data = data,
dependent.variable.name = "response"
)
}