My first attempt at prediction on a big data set. Uses the Kaggle Hearst Challenge data set Trying to learn from the Nearest Neighbors homepage Short link: http://bit.ly/HearstChallenge Summary kd_tree.py Naive kd-Tree implementation