Skip to content

windweller/sentence2vec

 
 

Repository files navigation

sentence2vec

Tools for mapping a sentence with arbitrary length to vector space

We provide an implementation of the Paragraph Vector in Quoc Le and Tomas Mikolov's paper: Distributed representations of Sentences and Documents.

This project is based on gensim.

install requires:

  • 'scipy >= 0.7.0'
  • 'six >= 1.2.0'

2014-9-23 update: add test files for demo.

Fork Changes

added class DirectoryForSentence to iterator over a folder, treating each file as a "sentence".
added a MTurk python script to preprocess CSV formatted file and put into a directory for DirectoryForSentence

About

Tools for mapping a sentence with arbitrary length to vector space

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.8%
  • C 0.2%