- Description
A Python project to cluster a collection of high-dimensional text streams.
- Author
Krishna Y. Kamath
A Python project to cluster a collection of high-dimensional text streams. An example of hd-streams is Twitter, where every user can be considered as a separate evolving stream of tweets. This project implements techniques to cluster such streams efficiently.
The data for this project will be a stream of tweets in json format. For more details look at Twitter Streaming API.