Lórum ipse

Running the text generator in browser

scripts/setup.sh

Prepares corpus chunks from the limited subset of Hungarian Webcorpus in build/text_template

scripts/run.sh

The browser should be pointed to http://localhost:9999

Text generation with a template

gzcat resource/sg3_nom_acc_sentences_xaa.txt.gz | langmodel/gibberize.py | less

replaces content words in input sentences with gibberish word forms. The input should be formatted as follows:

# sentence_number

word <TAB> lemma <TAB> analysis

...

# sentence_number

...

Text generation with a fixed template

basic_sentence_demo.py

generates 1000 sentences with definite_article subject verb indefinite_article adjective object structure

Generate random words based on training words

phonmodel.py <list-of-existing-stems

This will create a trigram model based on the input character sequences and output 100 generated stems

Filter sentences from webcorpus

gzcat webcorpus.tagged.gz | iconv -f latin2 -t utf8 | resource/webcorp-parse.py | resource/sentence-filter.py

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github/workflows		.github/workflows
langmodel		langmodel
resource		resource
scripts		scripts
webapp		webapp
.gitignore		.gitignore
Dockerfile		Dockerfile
Procfile		Procfile
README.md		README.md
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

langmodel

langmodel

resource

resource

scripts

scripts

webapp

webapp

.gitignore

.gitignore

Dockerfile

Dockerfile

Procfile

Procfile

README.md

README.md

requirements.txt

requirements.txt

runtime.txt

runtime.txt

Repository files navigation

Lórum ipse

Running the text generator in browser

Text generation with a template

Text generation with a fixed template

Generate random words based on training words

Filter sentences from webcorpus

About

Releases

Packages 1

Contributors 3

Languages

lorumipse/lorumipse

Folders and files

Latest commit

History

Repository files navigation

Lórum ipse

Running the text generator in browser

Text generation with a template

Text generation with a fixed template

Generate random words based on training words

Filter sentences from webcorpus

About

Topics

Resources

Stars

Watchers

Forks

Languages