Skip to contents

Data frame with known relationship between responses and predictors useful to illustrate multicollinearity concepts. Created from vi using the code shown in the example.

Usage

data(toy)

Format

Data frame with 2000 rows and 5 columns.

Details

Columns:

  • y: response variable generated from a * 0.75 + b * 0.25 + noise.

  • a: most important predictor of y, uncorrelated with b.

  • b: second most important predictor of y, uncorrelated with a.

  • c: generated from a + noise.

  • d: generated from (a + b)/2 + noise.

These are variance inflation factors of the predictors in toy. variable vif b 4.062 d 6.804 c 13.263 a 16.161