NLP paper implementation with PyTorch

The papers were implemented in using korean corpus

Classification

Using the Naver sentiment movie corpus v1.0
Hyper-parameter was arbitrarily selected. (epoch: 5, mini_batch: 128)

	Train ACC (120,000)	Validation ACC (30,000)	Test ACC (50,000)
SenCNN	92.87%	86.87%	86.38%
CharCNN	85.63%	81.58%	81.58%
ConvRec	86.80%	82.66%	82.29%
VDCNN	86.31%	83.87%	83.90%
SAN	93.90%	86.52%	86.35%

Convolutional Neural Networks for Sentence Classification (as SenCNN)
- https://arxiv.org/abs/1408.5882
Character-level Convolutional Networks for Text Classification (as CharCNN)
- https://arxiv.org/abs/1509.01626
Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers (as ConvRec)
- https://arxiv.org/abs/1602.00367
Very Deep Convolutional Networks for Text Classification (as VDCNN)
- https://arxiv.org/abs/1606.01781
A Structured Self-attentive Sentence Embedding (as SAN)
- https://arxiv.org/abs/1703.03130

Semantic textual similarity

Creating dataset from https://github.com/songys/Question_pair
Hyper-parameter was arbitrarily selected. (epoch: 5, mini_batch: 64)

	Train ACC (6,060)	Validation ACC (1,516)
SAN	91.93%	81.46%

A Structured Self-attentive Sentence Embedding (as SAN)
- https://arxiv.org/abs/1703.03130

Language model

Character-Aware Neural Language Models
- https://arxiv.org/abs/1508.06615

Named entity recognition

Using the Naver nlp-challange corpus for NER
Hyper-parameter was arbitrarily selected.

Bidirectional LSTM-CRF Models for Sequence Tagging
- https://arxiv.org/abs/1508.01991
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF
- https://arxiv.org/abs/1603.01354
Neural Architectures for Named Entity Recognition
- https://arxiv.org/abs/1603.01360

Neural machine translation

Effective Approaches to Attention-based Neural Machine Translation
- https://arxiv.org/abs/1508.04025
Attention Is All You Need
- https://arxiv.org/abs/1706.03762

Machine reading comprension

Bi-directional attention flow for machine comprehension
- https://arxiv.org/abs/1611.01603

Transfer learning

Deep contextualized word representations
- https://arxiv.org/abs/1802.05365
Improving Language Understanding by Generative Pre-Training
- https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- https://arxiv.org/abs/1810.04805
Language Models are Unsupervised Multitask Learners
- https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
A_Structured_Self-attentive_Sentence_Embedding_cls		A_Structured_Self-attentive_Sentence_Embedding_cls
A_Structured_Self-attentive_Sentence_Embedding_sts		A_Structured_Self-attentive_Sentence_Embedding_sts
Bidirectional_LSTM-CRF_Models_for_Sequence_Tagging		Bidirectional_LSTM-CRF_Models_for_Sequence_Tagging
Character-level_Convolutional_Networks_for_Text_Classification		Character-level_Convolutional_Networks_for_Text_Classification
Convolutional_Neural_Networks_for_Sentence_Classification		Convolutional_Neural_Networks_for_Sentence_Classification
Efficient_Character-level_Document_Classification_by_Combining_Convolution_and_Recurrent_Layers		Efficient_Character-level_Document_Classification_by_Combining_Convolution_and_Recurrent_Layers
Very_Deep_Convolutional_Networks_for_Text_Classification		Very_Deep_Convolutional_Networks_for_Text_Classification
tf_version/Convolutional_Neural_Networks_for_Sentence_Classification		tf_version/Convolutional_Neural_Networks_for_Sentence_Classification
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A_Structured_Self-attentive_Sentence_Embedding_cls

A_Structured_Self-attentive_Sentence_Embedding_cls

A_Structured_Self-attentive_Sentence_Embedding_sts

A_Structured_Self-attentive_Sentence_Embedding_sts

Bidirectional_LSTM-CRF_Models_for_Sequence_Tagging

Bidirectional_LSTM-CRF_Models_for_Sequence_Tagging

Character-level_Convolutional_Networks_for_Text_Classification

Character-level_Convolutional_Networks_for_Text_Classification

Convolutional_Neural_Networks_for_Sentence_Classification

Convolutional_Neural_Networks_for_Sentence_Classification

Efficient_Character-level_Document_Classification_by_Combining_Convolution_and_Recurrent_Layers

Efficient_Character-level_Document_Classification_by_Combining_Convolution_and_Recurrent_Layers

Very_Deep_Convolutional_Networks_for_Text_Classification

Very_Deep_Convolutional_Networks_for_Text_Classification

tf_version/Convolutional_Neural_Networks_for_Sentence_Classification

tf_version/Convolutional_Neural_Networks_for_Sentence_Classification

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

NLP paper implementation with PyTorch

Classification

Semantic textual similarity

Language model

Named entity recognition

Neural machine translation

Machine reading comprension

Transfer learning

About

Releases

Packages

Languages

License

urielll/nlp_implementation-1

Folders and files

Latest commit

History

Repository files navigation

NLP paper implementation with PyTorch

Classification

Semantic textual similarity

Language model

Named entity recognition

Neural machine translation

Machine reading comprension

Transfer learning

About

Resources

License

Stars

Watchers

Forks

Languages