GitHub - ljo/collatex: CollateX

CollateX is a software to

read multiple (≥ 2) versions of a text, splitting each version into parts (tokens) to be compared,
identify similarities of and differences between the versions (including moved/transposed segments) by aligning tokens, and
output the alignment results in a variety of formats for further processing, for instance
to support the production of a critical apparatus or the stemmatical analysis of a text's genesis.

It resembles software used to compute differences between files (e.g. diff) or tools for sequence alignment which are commonly used in Bioinformatics. While CollateX shares some of the techniques and algorithms with those tools, it mainly aims for a flexible and configurable approach to the problem of finding similarities and differences in texts, sometimes trading computational soundness or complexity for the user's ability to influence results.

As such it is primarily designed for use cases in disciplines like Philology or – more specifically – the field of Textual Criticism where the assessment of findings is based on interpretation and therefore can be supported by computational means but is not necessarily computable.

Please go to http://collatex.net/ for further information.

Name		Name	Last commit message	Last commit date
Latest commit History 2,945 Commits
collatex-core		collatex-core
collatex-pythonport		collatex-pythonport
collatex-servlet		collatex-servlet
collatex-tools		collatex-tools
site		site
.editorconfig		.editorconfig
.gitignore		.gitignore
CREDITS		CREDITS
LICENSE.txt		LICENSE.txt
README.md		README.md
RELEASE-HOWTO.md		RELEASE-HOWTO.md
changelog.txt		changelog.txt
logging.properties		logging.properties
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

collatex-core

collatex-core

collatex-pythonport

collatex-pythonport

collatex-servlet

collatex-servlet

collatex-tools

collatex-tools

site

site

.editorconfig

.editorconfig

.gitignore

.gitignore

CREDITS

CREDITS

LICENSE.txt

LICENSE.txt

README.md

README.md

RELEASE-HOWTO.md

RELEASE-HOWTO.md

changelog.txt

changelog.txt

logging.properties

logging.properties

pom.xml

pom.xml

Repository files navigation

About

Releases

Packages

Languages

License

ljo/collatex

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Languages