Skip to content

rishab93/py_stringmatching

 
 

Repository files navigation

py_stringmatching

This project seeks to build a Python software package that consists of a comprehensive and scalable set of string tokenizers (such as alphabetical tokenizers, whitespace tokenizers) and string similarity measures (such as edit distance, Jaccard, TF/IDF). The package is free, open-source, and BSD-licensed.

Important links

Dependencies

py_stringmatching has been tested on Python 2.7, Python 3.3, Python 3.4 and Python 3.5.

The required dependencies to build the package are NumPy 1.7.0 or higher, Six, and a C or C++ compiler. For the development version, you will also need Cython.

Platforms

py_stringmatching has been tested on Linux, OS X and Windows.

About

A comprehensive and scalable set of string tokenizers and similarity measures in Python

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 97.4%
  • Batchfile 1.4%
  • PowerShell 1.1%
  • Shell 0.1%