Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Report		Report
Response		Response
dev		dev
generated_files		generated_files
src		src
test		test
train		train
README.md		README.md
dev.key		dev.key
report.pdf		report.pdf
scorer.py		scorer.py
sentiment-vocab.tff		sentiment-vocab.tff
train.key		train.key

Repository files navigation

NLP-Project1

The project has the following structure:

A directory called src which contains the code for the classiﬁers and for the plotting functionality.
A directory called Response which contains the response ﬁles of my “main” classiﬁer and “special” classiﬁer. My best “main” classiﬁer was the Naive Bayes, with a smoothing parameter of α = 10−3. I tried to improve on the Naive Bayes Classiﬁer according to the paper mentioned in the project speciﬁcation. My best classiﬁer from that was the Complement Naive Bayes classiﬁer, with the same smoothing parameter as above. (More details in the report)
A directory called generated ﬁles which contain ﬁles where I have dumped the data produced by the classiﬁers. Some of the ﬁles in that directory are used to plot the data, and some are used to just to verify the metrics.
Directories train, dev and test contain the training, development and test data respectively.
The ﬁles train.key, dev.key and scorer.py are used to obtain the accuracy metrics of the training and development data.
The ﬁle sentiment-vocab.tﬀ is the sentiment vocabulary

About

Text Classification

Report repository

Releases

No releases published

Packages

No packages published