Skip to content

henchc/MHG-scansion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Middle High German Scansion

This repo contains the most recent efforts in a model for MHG epic meter scansion. It builds off of the repo CLFL 2016, in which a CRF model was employed. Current improvements include:

  • Restriction to 4 stresses per line, 1 stress per foot
  • Secondary stress is not longer predicted, but algorithmically determined
  • A GUI has been created in tkinter

NAACL-CLFL 2016

Title: Supervised Machine Learning for Hybrid Meter
Authors: Alex Estes and Christopher Hench

Abstract:
Following classical antiquity, European poetic meter was complicated by traditions negotiating between the prosodic stress of vernacular dialects and a classical system based on syllable length. Middle High German (MHG) epic poetry found a solution in a hybrid qualitative and quantitative meter. We develop a CRF model to predict the metrical values of syllables in MHG epic verse, achieving an F-score of .894 on 10-fold cross-validated development data (outperforming several baselines) and .904 on held-out testing data. The method used in this paper presents itself as a viable option for other literary traditions, and as a tool for subsequent genre or author analysis.

Data and source code for paper

Dependencies:

  • sklearn (pip install scikit-learn)
  • nltk (pip install nltk)
  • pycrfsuite (pip install python-crfsuite)

About

most recent MHG scansion model and GUI

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages