Skip to content

This repo contains statistics computed after analyzing data from twitter.

Notifications You must be signed in to change notification settings

slick2/Statistics

 
 

Repository files navigation

Statistics

This repo contains statistics computed after analyzing data from twitter.

The statistics will be used to create visualizations, further analysis and for machine learning models.

######################################### dataExtraction.py - run this on your json files (what you got from API calls). This will extract relevant info from the json tweets.

    python dataExtraction.py [JSON_FILE] > [my_file]

The output for each tweet will be of this form (a JSON encoded list).

  [user_id, text, fav_count, retweets, index, date, hashtags]

segregationByUsers.py and segregationByDate.py runs on the [my_file] and clubs the tweets based on data/user.

top_hashtags.py runs on the [JSON_FILE] to compute top hashtags in the file.

pop_words.py runs on the [my_file] and compute top n most frequent words in the file.

About

This repo contains statistics computed after analyzing data from twitter.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%