forked from commonsense/metanl
Some convenient natural language tools that build on NLTK.
License
imclab/metanl
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Multilingual natural language tools, wrapping NLTK and other systems. This package provides wrappers around NLTK and other systems to provide convenient natural language tools, such as: - Tokenizers - Stopword removers - Word frequency lookup - Lemmatizers (which reduce words to their root form, possibly taking part-of-speech tags into account) - Analyzers for East Asian languages (for example, we currently use a MeCab process to find word breaks in Japanese) For word frequencies in some language, metanl uses corpora from the University of Leeds Center for Translation Studies (http://corpus.leeds.ac.uk/list.html), whose data is released under the Creative Commons Attribution license. Author: Rob Speer
About
Some convenient natural language tools that build on NLTK.
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published