Source code for my M.Sc. thesis project in Computer Science at IT-University in Copenhagen, DK. 2013: "Clustering Player Behavior in Data Streams using MapReduce"
Here you can find:
- Normal Kmeans using scipy and numpy arrays
- mrjob Map-Reduce implementation (using a combiner function and a numpy array vectorisation)