Skip to content

An implementation of the InferDTD algorithm as described in "Bex et al. Inference of concise DTDs from XML data"

License

Notifications You must be signed in to change notification settings

mvaled/inferdtd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This package implements the algorithm iDTD described in Bex et al., Inference of consise DTDs from XML data

Given a set of XML sample the algorithm infers a DTD which can validate those XML sample. There are some restrictions, though:

  1. The inference is restricted to the structure of the document, i.e. no attempt is made to infer the data type of the data involved.
  2. Currently we have only implemented the inference of SORE, not CRX. So many samples should be provided in order to infer a good DTD. Notice that if your XML is like a listing, then you can provide many samples of the items in a single XML document.

Warning

I haven't had the time to finish this project; the basic algorithm is implemented, however it still lacks the main functionality of generating the DTD. Volunteers are welcome!

License

Alpha versions of this package are distribute under the terms of GPL version 2.

We plan to make relicense stable versions to LGPL.

About

An implementation of the InferDTD algorithm as described in "Bex et al. Inference of concise DTDs from XML data"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages