Janome

Janome is a Japanese morphological analysis engine written in pure Python.

General documentation:

http://mocobeta.github.io/janome/en/ (English)

http://mocobeta.github.io/janome/ (Japanese)

Requirements

Python 2.7.x or 3.3+ is required.

Install

[Note] This consumes about 500 MB memory for building.

(venv) $ python setup.py install

Run

(env) $ python
>>> from janome.tokenizer import Tokenizer
>>> t = Tokenizer()
>>> for token in t.tokenize(u'すもももももももものうち'):
...     print(token)
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も    助詞,係助詞,*,*,*,*,も,モ,モ
もも  名詞,一般,*,*,*,*,もも,モモ,モモ
も    助詞,係助詞,*,*,*,*,も,モ,モ
もも  名詞,一般,*,*,*,*,もも,モモ,モモ
の    助詞,連体化,*,*,*,*,の,ノ,ノ
うち  名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ

License

Licensed under Apache License 2.0 and uses the MeCab-IPADIC dictionary/statistical model.

See LICENSE.txt and NOTICE.txt for license details.

Acknowledgement

Special thanks to @ikawaha and @takuya_a.

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
benchmark		benchmark
bin		bin
docs		docs
docs_en		docs_en
examples		examples
ipadic		ipadic
janome		janome
scripts		scripts
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
CHANGES.txt		CHANGES.txt
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.rst		README.rst
setup.py		setup.py

License

shnend/janome

Folders and files

Latest commit

History

Repository files navigation

Janome

Requirements

Install

Run

License

Acknowledgement

Copyright

About

Resources

License

Stars

Watchers

Forks

Languages