Skip to content
/ utils Public

Utilities for development polish language support for Spacy.

Notifications You must be signed in to change notification settings

spacy-pl/utils

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spacy-pl-utils

Utilities for development polish language support for Spacy.

Workflow rules

Guide is in this document.

Folders

Use separated folders for every module pipeline.

Using DVC

We store all artifacts in folders data and models.

Adding new step

To add new step to pipeline use

dvc run [-d <dependencies>] [-o <result_artifacts>] (-f <step_name>.dvc) <command>

in main folder.

Reproduce artifact

When something in pipeline changed and you want to run all steps again, use

dvc repro <step_name>

with <step_name> to which you want to rerun pipeline.

Synchronization

First, set up remote.

Synchronization always works with in connection with current branch and commit. dvc push Send all tracked artifacts to remote. dvc pull Get all tracked arfitfacts from remote.

Print pipeline

dvc pipeline show -c --ascii

Releasing models

To package model please run python deployment/combine_and_package.py with proper arguments. For example

python deployment/combine_and_package.py \
 --pos-path models/pos_NKJP_word2vec/model-fial/ \
 --tree-path models/trees-pos_LFG_word2vec/model-final/ \
 --ner-path models/ner_nkjp_word2vec/model-final/ \
 --output-path models/release \
 --blank-vectors-path models/blank_NKJP_word2vec/

fill poll presented by script with usefull infomations

About

Utilities for development polish language support for Spacy.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •