GitHub - bossjones/auto-caption: Produce captions for videos using PocketSphinx speech recognition

This project contains two scripts that use PocketSphinx to produce captions from a video file using speech recognition. The dependencies are a bit tricky, a Dockerfile is provided to produce a working environment. Specifically, the script currently relies on an unlanded patch to the PocketSphinx Gstreamer plugin.

caption.py takes a media file and generates a caption file. You can test this script with the pre-built docker image luser/auto-caption:0.2, for example:

docker run -t luser/auto-caption:0.1 ./run.sh https://people.mozilla.org/~tmielczarek/test-long.wav

Will produce captions on stdout.

adapt-from-captions.py takes a media file, a manually corrected captions file, and a PocketSphinx acoustic model, and adapts the model by feeding it the matched input audio and corrected text. It will output updated-model.tar.gz in the working directory if it succeeds.

Any copyright is dedicated to the Public Domain. http://creativecommons.org/publicdomain/zero/1.0/

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
adapt-from-captions.py		adapt-from-captions.py
caption.py		caption.py
download-pocketsphinx-lm.sh		download-pocketsphinx-lm.sh
install-pocketsphinx.sh		install-pocketsphinx.sh
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

adapt-from-captions.py

adapt-from-captions.py

caption.py

caption.py

download-pocketsphinx-lm.sh

download-pocketsphinx-lm.sh

install-pocketsphinx.sh

install-pocketsphinx.sh

run.sh

run.sh

Repository files navigation

About

Releases

Packages

Languages

License

bossjones/auto-caption

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Languages