Skip to content

bertini36/SpanishCorpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spanish Corpus

The Python dependencies are described in requirements.txt. To install I recommend install virtualenv and virtualenvwrapper (http://rukbottoland.com/blog/tutorial-de-python-virtualenvwrapper/). Then you can install the spanishCorpus requirements with the command: pip install -r requirements.txt

This little library needs some packages of NLTK (Natural Language Toolkit). To install these packages is required init a Python console in the virtualenv in which the requirements are installed.

Now enter the command: nltk.download()

In the nltk manager you have to ensure that Corpora has installed:

  • cess_esp
  • omw
  • stopwords
  • wordnet
  • words

And in Models you have to ensure that Snowball_data is installed.

Install German and French dictionaries to more accurately:

  • sudo apt-get install myspell-de-de
  • sudo apt-get install myspell-fr-fr

In main.py you have an example of the library's use.

About

Library to ease spanish text mining in Python language

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages