A set of toy problems examining different encoding methods for categorical variables for the purpose of classification.
- Ordinal
- One-Hot
- Binary
- Helmert Contrast
- Sum Contrast
- Polynomial Contrast
- Backward Difference Contrast
The datasets used in these examples are car, mushroom, and splice datasets from the UCI dataset repository, found here:
BSD