NLTK-Python

Pull Twitter feeds and score sentiment of tweets; then use training set to classify tweets. Also visualize the frequency of unigrams, bigrams and other ngrams, as well as stemming & lemmatization effects.

Next steps include using the features (either unigrams, or ngrams) to train the data and then classify. Additionally, analyze the retweet count as a response variable while using features (presence of terms) in text corpus to predict the probability of retweet count.

Long term steps include incorporating other social media APIs through Python (Pinterest, Facebook, Yelp, and Google+) to indicate the overall web sentiment of particular food/restaurant/business.

Also long term, analyze Uber geographically, to determine if using Census data (on demographics) shows any patterns in positive sentiment. Do older/younger, more affluent/poor, or those without as many public transportations/restaurants have varying levels of Uber sentiment?

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitattributes		.gitattributes
README.md		README.md
Top_Tweeters.svg		Top_Tweeters.svg
machinelearn.py		machinelearn.py
pie_chart_twitter_usersource.svg		pie_chart_twitter_usersource.svg
sentiment_scoring.py		sentiment_scoring.py
twitpull.py		twitpull.py
unigrams-neg.txt		unigrams-neg.txt
unigrams-pos.txt		unigrams-pos.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitattributes

.gitattributes

README.md

README.md

Top_Tweeters.svg

Top_Tweeters.svg

machinelearn.py

machinelearn.py

pie_chart_twitter_usersource.svg

pie_chart_twitter_usersource.svg

sentiment_scoring.py

sentiment_scoring.py

twitpull.py

twitpull.py

unigrams-neg.txt

unigrams-neg.txt

unigrams-pos.txt

unigrams-pos.txt

Repository files navigation

NLTK-Python

About

Releases

Packages

Languages

r14152/NLTK-Python

Folders and files

Latest commit

History

Repository files navigation

NLTK-Python

About

Resources

Stars

Watchers

Forks

Languages