Skip to content

Linguistic feature extractor for Russian text annotated with RFTagger. Extracted features were used for genre classification.

License

Notifications You must be signed in to change notification settings

Askinkaty/MDRus_analyser

Repository files navigation

To get the matrix of frequencies of every linguistic parameter of every text in the corpus (corpus_3.txt) run the file md_analyser.py. The program uses already morphologically annotated (by RFTagger) corpus (processed_corpus_3.xml). You can also use the table with the annotation in the system of the FTDs for further experiments (annotation_corpus3.csv).

About

Linguistic feature extractor for Russian text annotated with RFTagger. Extracted features were used for genre classification.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages