Skip to content

jonovik/bibliograph.parsing

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bibliograph.parsing

This package provides named utilities for parsing bibliographic references from a number of standard formats into python dictionaries. Supported formats include bibtex, endnote, medline, ris and xml (mods).

Details

Each parser accepts input from a given bibliographic reference format and outputs a list of python dictionaries, one for each entry listed in the input source. Each of these dictionaries will contain some number of the following fields:

Field Name: Required: Description of Field Contentsx:
reference_type Yes the type of content referenced by this entry
title Yes the title of the content referenced by this entry
abstract No short description or summary of the content referenced by this entry
publisher ? name of the publishing company
publication_year ? year in which the content was published
publication_month ? month in which the content was published
publication_url ? fully-qualified url pointing to an online version of the content
authors Yes list of dictionaries, one for each author of the content. The dictionaries will contain three items: 'firstname' (given name), 'lastname' (surname, family name), middlename (any name or names in-between the first and last names)
journal No Title of the journal in which the content appears
volume No Volume of the periodical in which the content appears
number No Number of the periodical in which the content appears
pages No Page numbers within the given volume:number of the periodical in which the content appears

Requirements

Configuration

bibliograph.parsing honors the environment variable FIX_BIBTEX. If set, the module will clean up BibTeX import data through the bib2xml | xml2bib pipeline in order cleanup up improper or misformatted BixTeX data. However you may lose some data (e.g. the anotate field will be filtered out through Bibutils).

References

Formats for input files have been gleaned from a number of sources:

Resources

Contributors

About

named utilities for parsing bibliographic references from a number of standard formats (bibtext, endnote, medline, ris, xml (mods))

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 67.2%
  • TeX 32.8%