Skip to content

brianhuey/Netflix

Repository files navigation

Netflix-project

We are working the Netflix dataset used in the 2007 KDD Cup competition which provides information on characteristics of users of the Netflix video services who left ratings for movies from the years 1998 to 2004 (http://www.kdd.org/kdd-cup-2007-consumer-recommendations).

Structure:


/baseline - map/reduce jobs used to create global effects features and append them to the validation/test set. Graphs of feature.
/reports - report documentation. Herein lies our final report.
/transform_and_predict - Transforms features, fits logistic model, makes predictions and computes RMSE with the answer set from 2006
/data_manipulation - various operations to produce validation sets and format data
/training_set_sample - map/reduce jobs used to subset and sample the training set.
/nmf - Create and run non-negative matrix factorization alogrithm. Graph of feature.
/cosine - Create and run cosine simiarlity alogrithm. Graphs of feature.

About

Team 15 Netflix!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published