Skip to content

turicas/pypln.web

 
 

Repository files navigation

PyPLN

PyPLN is a distributed pipeline for natural language processing, made in Python. We use NLTK and ZeroMQ as our foundations. The goal of the project is to create an easy way to use NLTK for processing big corpora, with a REST API.

PyPLN is sponsored by Fundação Getulio Vargas.

License

PyPLN is free software, released under the GPLv3 https://gnu.org/licenses/gpl-3.0.html.

Using

You can start by looking at our quickstart guide.

Installing

To install dependencies (on a Debian-like GNU/Linux distribution):

sudo apt-get install python-setuptools mongodb
pip install virtualenv virtualenvwrapper
mkvirtualenv pypln.web
pip install -r requirements/production.txt

You will also need to install NLTK data. You can do so following the NLTK documentation.

Developing

To run tests:

workon pypln.web
pip install -r requirements/development.txt
make test

To run the development webserver:

workon pypln.web
pip install -r requirements/project.txt
./manage.py runserver --settings=pypln.web.settings.development

See our code guidelines.

About

Web interface for PyPLN

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 96.6%
  • Shell 3.4%