The research project is completed for the Bioinformatics Algorithms Course at Computer Science, Purdue University.
You need to have the dataset in a data folder.
The file all_preprocess.py has all the preprocessing steps. The file step3.py has the step of the cross validation and performance output, it prints out the accuracies. The file plot.py, plots the graphs. The file test4.py has the RF explanation function.