Skip to content

An extendable docx file format parser and converter

License

Notifications You must be signed in to change notification settings

AaronWan/pydocx

 
 

Repository files navigation

PyDocX

image

PyDocX is a parser that breaks down the elements of an Office Open XML (.docx) and converts them into different markup languages. Currently, only HTML is supported. Markdown and LaTex are planned for the future. You can extend any of the available parsers to customize it to your needs. You can also create your own class that inherits DocxParser to create your own methods for a markup language not yet supported.

To get started using PyDocX, see the Usage guide and also Extending PyDocX.

About

An extendable docx file format parser and converter

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.9%
  • Shell 0.1%