Skip to content

An implementation of the Elongated K-Means Algorithm originally authored by Guido Sanguinetti, Jonathan Laidler and Neil D. Lawrence

Notifications You must be signed in to change notification settings

flybass/EKMeans

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

EKMeans

An implementation of the Elongated K-Means Algorithm originally authored by Guido Sanguinetti, Jonathan Laidler and Neil D. Lawrence. Please find their original paper at http://eprints.pascal-network.org/archive/00001544/01/clusterNumber.pdf.

There are a few notable additions. Following the example of researchers at Carnegie Mellon (please find their work at https://www.cs.cmu.edu/~jgc/publication/PublicationPDF/Automatic_Determination_Of_Number_Of_Clusters_For_Creating_Templates_In_Example-Based_Machine_Translation.pdf), a noise-threshold was added to the algorithm.

Also, an experimental one-cluster test is added to the algorithm. As the affinity matrix is the only input, we try to perturb the affinity matrix directly and look at the difference in fit-statistics between 1 cluster and the fit for k>2.

About

An implementation of the Elongated K-Means Algorithm originally authored by Guido Sanguinetti, Jonathan Laidler and Neil D. Lawrence

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages