Sentiment analysis project
-
Folder StockAnalysis - all data acquisition and data pre-processing code.
-
Folder Sentiment - Sentiment analysis and sentiment classification code.
-
Folder rating - code for Analyst credibility module.
-
scrape.py : Distributed scraping
-
preprocess.py : Data pre-processing
-
match.py : Named Entity Recognition
-
clean.py : Handle negation
-
featurelist.py : Generate list of features
-
label.py : Perform sentiment labelling
-
dictionary.txt : File containing financial dictionary words
-
stopwords.txt : File containing stopwords
-
bayes_clean.py : Code to clean data for Bayes module
-
bayes.py : Bayes code
-
my_metrics.py : Calculate metrics of system
-
NLTK.py : NLTK Bayes code
This is where the Bayes prediction model is stored. New incoming comments are classified based on this saved model.