forked from turian/parser-model
-
Notifications
You must be signed in to change notification settings - Fork 0
Sandy4321/parser-model
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Parser model written by Joseph Turian available from: http://github.com/turian/parser-model/tree/master A neural network with a sparse input, for predicting decisions of a natural language syntax parser. Designed to make decisions in Giuseppe Attardi's DeSR parser, but usable in other history-based parsers. Little of this code is parser specific, it is mainly generic neural network code. desr.txt is a notepad of my results while experimenting with the DeSR parser. features/ are used for transforming the features, to include Brown clusters or word embeddings. See features/README.txt This code relies upon Theano (http://pylearn.org/theano/) ================================== DATA PREPROCESSING You need a $DATA directory containing $DATA/english_ptb_train.feats After editing the hyperparameters.*.yaml: # Create files fmap.features.pkl.gz and fmap.labels.pkl.gz ./extract-features.py # Create files $DATA/english_ptb_train.train.feats and $DATA/english_ptb_train.validation.feats ./partition-examples.py # Shuffle examples grep . $DATA/english_ptb_train.train.feats | ~/dev/common-scripts/shuffle.sh > $DATA/english_ptb_train.train.shuffled.feats mv $DATA/english_ptb_train.train.feats $DATA/english_ptb_train.train.unshuffled.feats ln -s english_ptb_train.train.shuffled.feats $DATA/english_ptb_train.train.feats grep . $DATA/english_ptb_train.validation.feats | ~/dev/common-scripts/shuffle.sh > $DATA/english_ptb_train.validation.shuffled.feats mv $DATA/english_ptb_train.validation.feats $DATA/english_ptb_train.validation.unshuffled.feats ln -s english_ptb_train.validation.shuffled.feats $DATA/english_ptb_train.validation.feats
About
A neural network with a sparse input, for predicting decisions of a natural language syntax parser.
Resources
Stars
Watchers
Forks
Packages 0
No packages published
Languages
- Python 95.7%
- Shell 2.6%
- Perl 1.7%