Skip to content

Extracts unformatted body text from MediaWiki wikitext

Notifications You must be signed in to change notification settings

edgarskos/mwtextextractor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mwtextextractor

https://travis-ci.org/danmichaelo/mwtextextractor.png?branch=master https://coveralls.io/repos/danmichaelo/mwtextextractor/badge.png

mwtextextractor extracts simple body text from MediaWiki wikitext by stripping off templates, html tags, tables, headers, etc. The extracted text can be used for word counting.

Example:

from mwtextextractor import get_body_text
print get_body_text('Lorem {{ipsum}} dolor')

About

Extracts unformatted body text from MediaWiki wikitext

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%