Long Short Term Memory Units

This is self-contained package to train a language model on word level Penn Tree Bank dataset. It achieves 115 perplexity for a small model in 1h, and 81 perplexity for a big model in a day. Model ensemble of 38 big models gives 69 perplexity. This code is derived from https://github.com/wojciechz/learning_to_execute (the same author, but a different company).

More information: http://arxiv.org/pdf/1409.2329v4.pdf

POS Tagging

Modified original code for POS tagging for UVa Text Mining course. .953 accuracy on 10% of treebank data. Word embeddings pulled from Ronan and Collobert's 2011 paper, you can find a copy here: http://ronan.collobert.com/senna/

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data		data
embeddings		embeddings
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
base.lua		base.lua
data.lua		data.lua
data_gen.py		data_gen.py
main.lua		main.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

embeddings

embeddings

CONTRIBUTING.md

CONTRIBUTING.md

README.md

README.md

base.lua

base.lua

data.lua

data.lua

data_gen.py

data_gen.py

main.lua

main.lua

Repository files navigation

Long Short Term Memory Units

POS Tagging

About

Releases

Packages

Languages

ebetica/lstm

Folders and files

Latest commit

History

Repository files navigation

Long Short Term Memory Units

POS Tagging

About

Resources

Stars

Watchers

Forks

Languages