Skip to content

mlcommons/peoples-speech

Repository files navigation

People's Speech Data Pipelines

Installation

# libprotobuf-dev is an onnx dependency, transitively brought in by nemo.
sudo apt-get install git-lfs sox ffmpeg
# Set up a virtual environment of some sort
pip install numpy Cython
python setup.py develop
cp galvasr2/*.jar $(python -c "import pyspark; print(pyspark.__path__[0])")/jars

Run forced alignment pipeline.

python galvasr2/align/spark/align_cuda_decoder.py --stage=0