Twitter data mining with Python

Using Support Vector Machine and Term Frequency–Inverse Document Frequency in three steps:

Collect many tweets from Twitter
Classify some tweets with positive, negative or neutral
Predict others tweets

System dependencies

sudo apt-get install build-essential python-dev python-setuptools \
                     python-numpy python-scipy libblas-dev gfortran \
                     libatlas-dev libatlas3gf-base liblapack-dev \
                     libatlas-base-dev

If you use Python 3

sudo apt-get install python3-minimal

Install Packages

Use pip with virtualenv

pip install -r requirements.txt

Configuration

The Natural Language Toolkit provide human language data (over 50 corpora and lexical resources) in different languages and formats as twitter samples, RSLP Stemmer (Removedor de Sufixos da Lingua Portuguesa), complete work of Machado de Assis for Brazilian Portuguese language and much more.

For download all corpora

python -m nltk.downloader all

Or download the corpora of your choice from Python Interpreter

>>> import nltk
>>> nltk.download()

A new window should open, showing the NLTK Downloader.

Credentials

Set your Twitter credentials from Twitter Application Manager for variables: CONSUMER_KEY, CONSUMER_SECRET, ACCESS_TOKEN and ACCESS_TOKEN_SECRET.

Run tests

python -m unittest discover

Start

Run the Human-Machine Interface

python hmi.py

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
app		app
data		data
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
hmi.py		hmi.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

data

data

tests

tests

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

hmi.py

hmi.py

requirements.txt

requirements.txt

Repository files navigation

Twitter data mining with Python

System dependencies

Install Packages

Configuration

Credentials

Run tests

Start

Example

Collect

Listing collected tweets

Classification

Predication

About

Releases

Packages

Languages

License

fernandopso/twitter-svm-tfidf.py

Folders and files

Latest commit

History

Repository files navigation

Twitter data mining with Python

System dependencies

Install Packages

Configuration

Credentials

Run tests

Start

Example

Collect

Listing collected tweets

Classification

Predication

About

Resources

License

Stars

Watchers

Forks

Languages