DrBronzeAge/LatinIntertextFinder_Alpha
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This is the alpha version of a module that finds and visualizes intertextualites in Latin-- initially built for the speeches of Cicero. It tackles the problem by lemmatizing all the words in the speech, then splitting each sentence into a bag of shingles. If sentences have any shingles that cross a threshold for Jaccard similarity they are flagged and eventually dumped into html for easier expert inspection. There are also functions to visualize the density and distribution of intertexts, but those currently rely on R.
About
Find Latin Intertext with bags of words/jaccard similarity
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published