Skip to content

soumith/gentle

 
 

Repository files navigation

Gentle

Robust yet lenient forced-aligner built on Kaldi. A tool for aligning speech with text.

Beta Build Status

Getting Started

There are three ways to install Gentle.

  1. Download the pre-built Mac application. This package includes a GUI that will start the server and a browser. It only works on Mac OS.

  2. Use the Docker image. Just run docker run lowerquality/gentle. This works anywhere Docker is installed.

  3. Download the source code and run ./install.sh. Then run python gentle.py to start the server. This works on Mac and Linux.

Using Gentle

By default, the aligner listens at http://localhost:8765. That page has a graphical interface for transcribing, viewing, and downloading audio.

There is also a REST API so you can use Gentle in your programs. Here's an example of how to use the API with CURL:

curl -F "audio=@audio.mp3" -F "transcript=@words.txt" "http://localhost:8765/transcriptions?async=false"

Packages

No packages published

Languages

  • Python 62.2%
  • C++ 21.4%
  • HTML 11.2%
  • Shell 4.3%
  • Makefile 0.9%