Skip to content

In this project, we collect the tweets from twitter API, using sentiment algorithm to divide users into two groups-positive and negative. Then using Spark to do k-means clustering in these two groups based on the followings of users. In the end, calculating the TF-IDF of tweets of different users, then do k-means clustering based on TF-IDF.

Notifications You must be signed in to change notification settings

bigmianhua/Twitter-clustering

About

In this project, we collect the tweets from twitter API, using sentiment algorithm to divide users into two groups-positive and negative. Then using Spark to do k-means clustering in these two groups based on the followings of users. In the end, calculating the TF-IDF of tweets of different users, then do k-means clustering based on TF-IDF.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages