Naive_Bayes_Classifier

This is a school project I did in the course "Data Mining and Data Warehousing"

This is a desktop application of a generic Classifier based on "Naive Bayes" algorithm using m-estimator (m=2) https://en.wikipedia.org/wiki/Naive_Bayes_classifier

Workflow

Constructing the structure of the model using Structure.txt file.
Data Pre-processing: Data Cleaning: Fill in missing values, Identify outliers and smooth out noisy data (using the Equal-width Partitioning Discretization Method) , Correct inconsistent data.
Loading the train set
Building the classifier using the train set
Loading the test set
Classifying the records with Naive Bayes classifier using m-estimator (m=2)

Resources

This project includes data files to test the classifier with:

Dataset general info.txt - general information about the data base from which the data is taken
Structure.txt - Description of the data set attributes.
train.csv - the train set
test.csv - the test set

Prerequisites

Install Python 2.7 (Since the project uses pandas library, best to use Anaconda Distribution) can download here: https://www.anaconda.com/download/

Running The Program

python Prog.py

Usage

Browse the directory with the Structure.txt , train.csv and test.cxs files
Type the desired number of Discretization Bins
Click Build
Click Classify

The classification results will be outputed to output.txt

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Resources		Resources
Classifier.py		Classifier.py
PreProcessing.py		PreProcessing.py
Prog.py		Prog.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resources

Resources

Classifier.py

Classifier.py

PreProcessing.py

PreProcessing.py

Prog.py

Prog.py

README.md

README.md

Repository files navigation

Naive_Bayes_Classifier

Workflow

Resources

Prerequisites

Running The Program

Usage

About

Releases

Packages

Languages

barshaison/Naive_Bayes_Classifier

Folders and files

Latest commit

History

Repository files navigation

Naive_Bayes_Classifier

Workflow

Resources

Prerequisites

Running The Program

Usage

About

Resources

Stars

Watchers

Forks

Languages