Skip to content

hetieke/precis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Précis

pronounced "preh-si"

Graph Based Multi-Document Text summarization engine with semantic feature weightage.

  • Currently the feature weighing is achieved using WordNet, but this will eventually be replaced with word2vec
  • Still a lot of performance enhancements required. More tests and more API features to follow

Setup

  • Install Python 2.7.x
  • Install virtualenv to manage packages
  • Install packages from eggs.txt
  • Install nltk data packages, from prompt execute these
pip install virtualenv
virtualenv .env # this creates environment for this application
source .env/bin/activate
pip install -r eggs.txt
python nltk_setup.py

or Use make shipped with this project

make env
source .env/bin/activate

Usage

  • After the setup you can run the following script against a document directory with a list of documents belonging to similar topic to generate Summary
./summarize.py <content directory path>

Running Tests (View Readme in tests)

make tests

About

Graph Based Multi-Document Text summarization engine with semantic feature weightage

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.9%
  • Makefile 1.1%