GitHub - DARPA-LORELEI/uhhmm

UHHMM - Unsupervised Hierarchical Hidden Markov Model

Usage: python3 scripts/d1trainer.py OR make

There is a sample config file in config/d1train.ini. If you run using make, the first time it runs it will copy that file into myconfig.ini. You can then modify that config file and subsequent runs of make will use it.

The config has an 'io' (input/output) section and a 'params' (machine learning parameters) section. 'io' requires an input file, output directory, and an optional dictionary file. The input file should contain one sentence per line, with space-separated ints representing tokens. The output directory is where all output will be stored. This is so multiple runs will be preserved if desired. The first thing the d1trainer.py script will do is copy the config file into the output directory, to promote reproducibility. The dictionary file should contain a mapping between words and their index. Each line should be . While it is technically optional, it is borderline required for understanding output.

There are scripts in the scripts/ sub-directory to convert token files into int files, creating a dictionary as well.

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
config		config
data		data
doc/latex		doc/latex
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

doc/latex

doc/latex

scripts

scripts

.gitignore

.gitignore

Makefile

Makefile

README.md

README.md

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

DARPA-LORELEI/uhhmm

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages