k Nearest Neighbor

This repository consists of the k Nearest Neighbor machine learning algorithm. It's purpose is to find the example in the training set with the smallest Euclidian distance from the test set. k represents the number of voting neighbors to determine the test set's class. For example, if we are classifying iris flowers, k equals 3, and the 3 smallest training examples are classified as Iris-versicolor, Iris-setosa, and Iris-versicolor, the test set will be classified as Iris-versicolor since that class has the most voting neighbors.

Input

Upon entry to the program, you will be prompted for the following inputs:

Training data set file path
Test data set file path
Header file path (optional if data file's first row contains header names)
K value Notes: Be sure to only include one empty line at the end of each data file.

Functionality and Data Structures

Transform the file lines into a workable dataframes and convert the appropriate strings to numerical values.
Preprocess the datasets by normalizing the values.
Use a min heap to store Euclidian distances for each test set
Pop the first k values from the heap to determine classification for the test set

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.vscode		.vscode
__pycache__		__pycache__
abalone_data		abalone_data
iris_data		iris_data
mock_data		mock_data
.DS_Store		.DS_Store
README.md		README.md
cleanData.py		cleanData.py
main.py		main.py
minHeap.py		minHeap.py
nearestNeighbor.py		nearestNeighbor.py
preprocess.py		preprocess.py
progressBar.py		progressBar.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

pycache

pycache

abalone_data

abalone_data

iris_data

iris_data

mock_data

mock_data

.DS_Store

.DS_Store

README.md

README.md

cleanData.py

cleanData.py

main.py

main.py

minHeap.py

minHeap.py

nearestNeighbor.py

nearestNeighbor.py

preprocess.py

preprocess.py

progressBar.py

progressBar.py

Repository files navigation

k Nearest Neighbor

Input

Functionality and Data Structures

About

Releases

Packages

Languages

danerbrear/k-nearest-neighbor

Folders and files

Latest commit

History

Repository files navigation

k Nearest Neighbor

Input

Functionality and Data Structures

About

Topics

Resources

Stars

Watchers

Forks

Languages