README

NAACL 2016 submission "Structured Prediction with Neural Context Features" By Pushpendre Rastogi, Ryan Cotterell, Jason Eisner

Bibtex

@conference{rastogi2016weighting,
Author = {Pushpendre Rastogi and Ryan Cotterell and Jason Eisner},
Booktitle = {Proceedings of NAACL},
Date-Added = {2016-04-09 07:28:31 -0400},
Date-Modified = {2016-04-09 07:32:46 -0400},
Keywords = {Neural Network,Neural Symbolic,Finite State Transducer},
Title = {Weighting Finite-State Transductions With Neural Context},
Year = {2016},
Abstract = {How should one apply deep learning to tasks such as morphological reinflection, which stochastically edit one string to get another? A recent approach to such sequence-to-sequence tasks is to compress the input string into a vector that is then used to generate the output string, using recurrent neural networks. In contrast, we propose to keep the traditional architecture, which uses a finite-state transducer to score all possible output strings, but to augment the scoring function with the help of recurrent networks. A stack of bidirectional LSTMs reads the input string from left-to-right and right-to-left, in order to summarize the input context in which a transducer arc is applied. We combine these learned features with the transducer to define a probability distribution over aligned output strings, in the form of a weighted finite-state automaton. This reduces hand-engineering of features, allows learned features to examine unbounded context in the input string, and still permits exact inference through dynamic programming. We illustrate our method on the tasks of morphological reinflection and lemmatization.}}

Instructions

Run the following command to compile the WFST portion of the model:

     $ cd src/python/transducer
     $ make # Make transducer.so and copy to src
     $ cd - # Go back to toplevel

The following command will train the neural transducer model on the 4th fold of the rP-pA morphological transduction task. The test file is not used at this point.

 PYTHONPATH=$PWD/src/python THEANO_FLAGS=floatX=float32 python -c "import transducer_score; print (
       transducer_score.main(train_fn='res/celex/rP-pA/0500/4/train',
                             dev_fn='res/celex/rP-pA/0500/4/dev',
                             test_fn='res/celex/rP-pA/0500/4/test',
                             folder='results/tmp'))"

Once the model is trained and stored in the tmp directory we can test the model as follows:

PYTHONPATH=$PWD/src/python THEANO_FLAGS=floatX=float32 python -c "import transducer_score; print (
      transducer_score.main(train_fn='res/celex/rP-pA/0500/4/train',
                            dev_fn='res/celex/rP-pA/0500/4/dev',
                            test_fn='res/celex/rP-pA/0500/4/test',
                            folder='results/tmp2',
                            pretrained_param_pklfile='results/tmp/transducer.pkl',
                            perform_training=0,
                            perform_testing=1,
                            nepochs=-1)"

For more complicated usage, including the exact parameters that were used to obtain the ablation results in the paper, see the scripts

src/python/transducer_celex.sh
src/python/transducer_celex_test.sh

These scripts contain the parameter strings that were used to obtain all the results in the paper.

FAQ

How to fix ImportError: No module named transducer?

If you get the following error:
```
  File "neural_wfst/src/python/transducer_score.py", line 25, in <module>
          from transducer.src.transducer import Transducer
  ImportError: No module named transducer
```
Then run the following command:
```
  $ cd src/python/transducer
  $ make # Make transducer.so and copy to src
```
See the Makefile in src/python/transducer to understand what's going on. In case there are further errors during compilation, then please raise an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
res		res
results		results
src/python		src/python
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
batch.sh		batch.sh
batch_hu.sh		batch_hu.sh
batch_sp.sh		batch_sp.sh
batch_test.sh		batch_test.sh
batch_test_dev.sh		batch_test_dev.sh
batch_test_sp.sh		batch_test_sp.sh
contributors.txt		contributors.txt
extract.py		extract.py
install_fst.sh		install_fst.sh
path.sh		path.sh
requirements.txt		requirements.txt
single_run.py		single_run.py
single_test.py		single_test.py
single_test_2.dev.py		single_test_2.dev.py
single_test_2.py		single_test_2.py
test_sp.py		test_sp.py
train.py		train.py
train.sh		train.sh
train_lang.py		train_lang.py
train_sp.py		train_sp.py
training_script.py		training_script.py

License

as1986/neural_wfst

Folders and files

Latest commit

History

Repository files navigation

README

Bibtex

Instructions

FAQ

About

Resources

License

Stars

Watchers

Forks

Languages