Text Classification

Text classification is an important theme and basic classification task in natural language processing and machine learning.
There are many methods to classify texts. Recently classification method by deep learning has been invented.
I will introduce classification by CNN and LSTM which is a representative deep learning.
And also summarized svm and naive bayes which are classic methods.

Run

python execute.py --method naive_bayes --dataset yelp_review_polarity

Experiments

Classifiers

Classifier	Link
CNN	Paper
LSTM	Keras
Character Level CNN	Paper
SVM	scikit-learn
Naive Bayes	scikit-learn

Result

AG's News

Classes: 4
Train Data Size: 120,000
Test DataSize: 7,600

Classifier	validation loss	validation accuracy
CNN	0.2994	0.9055
LSTM	0.2587	0.9106
Character Level CNN	0.3692	0.8709
SVM	-	0.9007
Naive Bayes	-	0.9182

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
classifiers		classifiers
dataset		dataset
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_helpers.py		data_helpers.py
execute.py		execute.py
logs.py		logs.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

classifiers

classifiers

dataset

dataset

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

data_helpers.py

data_helpers.py

execute.py

execute.py

logs.py

logs.py

requirements.txt

requirements.txt

Repository files navigation

Text Classification

Run

Experiments

Classifiers

Result

AG's News

About

Releases

Packages

Languages

License

guokeda/text_classification

Folders and files

Latest commit

History

Repository files navigation

Text Classification

Run

Experiments

Classifiers

Result

AG's News

About

Resources

License

Stars

Watchers

Forks

Languages