Sentiment Analysis of Twitter Data

Sentiment analysis (also known as opinion mining) refers to the use of natural language processing, text analysis and computational linguistics to identify and extract subjective information in source materials. The code given in the repository is implemented in python and is using Naive Bayes Classifier and also it's improved version.

This was built in Python 3.4 and please refer to the pre-requistes required for running it successfully.

Pre-requisites -

Python 3.4 (although it shoudn't have problem running on other Python versions apart from some syntax changes)
NLTK for Python
Stopwords corpus
Wordnet corpus

Usage -

Clone/Download zip of SentAnalysis.
Extract content of the zip.
Run mainFile.py in IDLE or cmd and you would see the following -

Enter file name: twitterData.csv #Enter the filename(enter path also if stored in another location)

Enter number of records to be read: 5000 #The top n records will be taken from the dataset for creating the model

Processing..

NBC or improved NBC? Enter NBC/iNBC:
(Improved NBC takes into account high information words and removes the low information words, cutoff set to 5) - iNBC

Improved NBC is running..

Accuracy = 75.15974440894568%

Enter sample text to determine sentiment: @VirginAmerica completely awesome experience last month BOS-LAS nonstop. Thanks for such an awesome flight and depart time. #VAbeatsJblue
positive

Steps to install nltk -

Extract the zip content
Open cmd
Run the following commands -

cd "folderpath where zip is extracted"
python setup.py install

For nltk_data -

Extract the contents in your python directory.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
Sentiment Analysis.pdf		Sentiment Analysis.pdf
classifierNBC.py		classifierNBC.py
classifierNBC1.py		classifierNBC1.py
correctingReplacingWords.py		correctingReplacingWords.py
mainFile.py		mainFile.py
multipleTagging.py		multipleTagging.py
nltk.zip		nltk.zip
nltk_data.zip		nltk_data.zip
tagAndLemmatize.py		tagAndLemmatize.py
twitterData.csv		twitterData.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Sentiment Analysis.pdf

Sentiment Analysis.pdf

classifierNBC.py

classifierNBC.py

classifierNBC1.py

classifierNBC1.py

correctingReplacingWords.py

correctingReplacingWords.py

mainFile.py

mainFile.py

multipleTagging.py

multipleTagging.py

nltk.zip

nltk.zip

nltk_data.zip

nltk_data.zip

tagAndLemmatize.py

tagAndLemmatize.py

twitterData.csv

twitterData.csv

Repository files navigation

Sentiment Analysis of Twitter Data

Pre-requisites -

Usage -

Steps to install nltk -

For nltk_data -

About

Releases

Packages

Languages

prav10194/sentiment-analysis-python

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis of Twitter Data

Pre-requisites -

Usage -

Steps to install nltk -

For nltk_data -

About

Resources

Stars

Watchers

Forks

Languages