Skip to content

The source code for "An Actor Critic Algorithm for Structured Prediction"

License

Notifications You must be signed in to change notification settings

liubo-cs/actor-critic-public

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Actor-Critic for Sequence Prediction

The reference implementation for the paper

An Actor-Critic Algorithm for Sequence Prediction (openreview, submitted to ICLR 2017)
by Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, Yoshua Bengio

Note, that in fact it is a heavily modified speech recognizer, so please do not be surprised by the presense of speech related terms in the code. The code is provided only for replication purposes, further development is not planned. If you have questions, please contact Dzmitry Bahdanau or just create an issue here.

How to use

  • install all the dependencies (see the list below)
  • set your environment variables by calling source env.sh
  • for training use $LVSR/bin/run.py train <save-to> <config>
  • for testing use $LVSR/bin/run.py search <model-path> <config>

Please proceed to exp/ted for the instructions how to replicate our machine translation results on TED data, or to exp/billion_words in order to run our spelling correction experiments.

Dependencies

  • Python packages: pykwalify, toposort, pyyaml, numpy, pandas, picklable-itertools, theano;
  • blocks
  • blocks-extras
  • fuel

The code in this repository is known to work with master branches of the repositories listed above as of 21.11.2016

License

MIT

About

The source code for "An Actor Critic Algorithm for Structured Prediction"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 94.5%
  • Python 4.9%
  • Shell 0.6%