GitHub - krishnamrith12/DCST_with_pretrained_layers: Augment Multitask morph tagger layer in DCST with LSTM level char embedding

Official code for the paper "Deep Contextualized Self-training for Low Resource Dependency Parsing".
If you use this code please cite our paper.

Requirements

Simply run:

Python 3.7
Pytorch 1.1.0
Cuda 10.0

pip install -r requirements.txt

Data

Preprocessed in note format. Data folder can be obtained from here.

Multilingual Word Embeddings

Set the word_path="./data/morph.word.200.vec" and char_path="./data/morph.char.30.vec" Embeddings can be found here Possible word embedding option: ['random', 'fasttext']
The multilingual word embedding (.vec extensions) should be placed under the data/multilingual_word_embeddings folder.

Low Resource In-domain Experiments

In order to run the low resource in-domain experiments there are three steps we need to follow:

Running the base Biaffine parser
Running the sequence tagger(s)
Running the combined DCST parser

Running code

Create saved_model empty folder to store the new models. If you want to run complete model then simply run bash script run_dcsh.sh otherwise Refer to corrsoponding section in run_dcsh.sh to run corrsopnding segments.

Input settings:

Without POS Tag : Don't use flag --use_pos for all stages, namely, base model, auxiliary tasks, Final ensembled model.
With Coarse level Tag : Use the input files from data folder from --use_pos flag here
With POS level Tag : Shuffle 2nd and 3rd column of all the files in data folder.

Mode:

Add pretrained model file from Here to path ./utils/morph_tagger/cwlm_lstm_crf_cas_2.model

Pretrained morph tagger layer : Run the run_dcst.sh.
Pretrained morph tagger layer with freezing: comment this and then run run_dcst.sh

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
examples		examples
utils		utils
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
run_dcst.sh		run_dcst.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

utils

utils

LICENSE.md

LICENSE.md

README.md

README.md

requirements.txt

requirements.txt

run_dcst.sh

run_dcst.sh

Repository files navigation

Requirements

Data

Multilingual Word Embeddings

Low Resource In-domain Experiments

Running code

Input settings:

Mode:

About

Releases

Packages

Languages

License

krishnamrith12/DCST_with_pretrained_layers

Folders and files

Latest commit

History

Repository files navigation

Requirements

Data

Multilingual Word Embeddings

Low Resource In-domain Experiments

Running code

Input settings:

Mode:

About

Resources

License

Stars

Watchers

Forks

Languages