Skip to content

a sandbox for studying different methods of coding categorical variables

License

Notifications You must be signed in to change notification settings

jizhihang/categorical_encoding

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Categorical Encoding Methods

A set of toy problems examining different encoding methods for categorical variables for the purpose of classification.

Encoding Methods

  • Ordinal
  • One-Hot
  • Binary
  • Helmert Contrast
  • Sum Contrast
  • Polynomial Contrast
  • Backward Difference Contrast

Datasets

The datasets used in these examples are car, mushroom, and splice datasets from the UCI dataset repository, found here:

datasets

License

BSD

About

a sandbox for studying different methods of coding categorical variables

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%