When the data is binary, setting the ranager
argument case.weights
helps to minimize the issues produced by class imbalance. This function takes a binary response variable and returns a vector of weights populated with the values 1/#zeros
and 1/#ones
. It is used internally by the function rf()
.
case_weights(data = NULL, dependent.variable.name = NULL)
Data frame with a response variable and a set of predictors. Default: NULL
Character string with the name of the response variable. Must be in the column names of data
. Default: NULL
A vector with a length equal to nrow(data)
with the respective weights of the cases.
if(interactive()){
data <- data.frame(
response = c(0, 0, 0, 1, 1)
)
case_weights(
data = data,
dependent.variable.name = "response"
)
}