Skip to content

alvations/Wikicorpus

Repository files navigation

Wikicorpus

This repo records a list of Wikipedia-related corpora

Off-the-shelf

Build-It-Yourself

  • SeedLing: a seed corpus for the Human Language Project
  • Lucene Wiki: After downloading wiki.xml, you can use WikiIndexer.py to index the text with pylucene.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages