Skip to content

snukky/vwgec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GEC VW

Development in progress...

Workflow

Given train.txt, dev.m2, and test.m2:

  1. Train model
    1. Extract features from train.txt producing train.feats and train.cwords
    2. Train VW classifier on train.feats creating model model.vw
  2. Tune threshold parameter
    1. Create file dev.txt with parallel sentences from dev.m2
    2. Extract features from dev.txt producing dev.feats and dev.cwords
    3. Run VW classifier on dev.feats and model model.vw producing dev.pred
    4. Perform grid search to find best threshold parameter
      1. Apply predictions dev.thr.pred into dev.thr.out using dev.cwords
      2. Run M2 scorer on dev.thr.out
  3. Evaluate
    1. Create file test.txt with parallel sentences from test.m2
    2. Extract features from test.txt producing test.feats and test.cwords
    3. Run VW classifier on test.feats and model model.vw producing test.pred
    4. Apply predictions test.pred into test.out using test.cwords
    5. Run M2 scorer on test.out

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published