Skip to content

DigitalExperiments/epi-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#About this Program This is a simple command-line utility to extract information from TEI XML documents. It is only compatible with python 2.7.x at the moment. It was created by the Digital Experiments Working Group at NYU for their Epigraphs Project.

It parses XML files using user-provided XPATH expressions, and writes these to the standard output, in tab-separated format.

#Usage Examples You can specify your xpath in the command line:

python parse.py -x '//epigraph/q/l' 4397.xml

Or you can put it all in a file called xpath.txt (example included in this repository).

You can also specify multiple xml files:

python parse.py 4397.xml 4399.xml 5129.xml

Or use the wildcard to parse all XML files in your directory:

python parse.py *.xml

You can also write the output to a file like this:

python parse.py 4397.xml > output.txt

Releases

No releases published

Packages

No packages published