Skip to content

liuxialong/word_embedding_theano

 
 

Repository files navigation

alt text

Word Embedding

Theano implementation of Paper "Natural Language Processing (almost) from Scratch"

Status

There are still quite a few important items to finish, but it seems like learning the embedding.

Input

ATIS Data. Contains 46635 sentences, with 572 words.

Output

Word embedding for each word.

How to run

python train.py

You might need to install at least Theano 0.7+ and numpy to run the program.

TODO

  1. cPickle the embedding at the end of each epoch.
  2. Use validation set to avoid over fitting.
  3. Hyper param tuning.
  4. GPU perhaps, cos it is really slow right now.
  5. Normalizing the embedding?

Useful links

A nice blog about Python Internal

About

Theano implementation of Paper "Natural Language Processing (almost) from Scratch"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%