Imputation before LDA classification

This repository aims at testing and comparing several imputation methods before performing LDA classification.

Description

We run our experiments on 2 datasets. The first one is a data banknote authentification dataset containing four numerical features and a binary class. The second is simply generated by sampling from 2 multivariate gaussian distributions. The nature of the covariance matrices as well as the dimension (number of features) can be defined via command line arguments, as well as the missingness probability and the type of missingness. The imputation methods that get tested are "Grand Mean", "Conditional Mean", "Nearest Neighbour", "Regression". The concept of conditional mean is described in the pdf file at the root of the repository, along with the description of my experiments and conclusion.

Getting Started

Dependencies

matplotlib=3.8
numpy=1.26
pillow=10.0
python=3.12
scikit-learn=1.3.0

Installing

cd ImputationForLDA
conda create --name {env} --file requirements.txt

Executing program

python main.py --dimensions 5 --cov_matrice normal --probs_missingness 0.1 --type_missingness MCAR

Help

Please reach out.

Authors

Mathieu Charbonnel

Version History

December 2023
- Initial Release

License

This project is not licensed.

Acknowledgments

To Robert J. Durrant who supervised this scientific work.
Lohweg,Volker. (2013). banknote authentication. UCI Machine Learning Repository.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
src		src
.gitignore		.gitignore
ImputationForLDA.pdf		ImputationForLDA.pdf
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

.gitignore

.gitignore

ImputationForLDA.pdf

ImputationForLDA.pdf

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

Repository files navigation

Imputation before LDA classification

Description

Getting Started

Dependencies

Installing

Executing program

Help

Authors

Version History

License

Acknowledgments

About

Releases

Packages

Languages

mathieu-charbonnel/ImputationForLDA

Folders and files

Latest commit

History

Repository files navigation

Imputation before LDA classification

Description

Getting Started

Dependencies

Installing

Executing program

Help

Authors

Version History

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Languages