Skip to content

pervrosen/FineMerge

 
 

Repository files navigation

Implementation of the FineMerge algorithm introduced in our paper Black-box Adaptation of ASR for Accented Speech

Setup

Clone the repository and run

pip install -r requirements.txt

Additionally for ctc beam decoding, install ctcdecode:

git clone --recursive https://github.com/parlance/ctcdecode.git
cd ctcdecode && pip install .

Apply FineMerge Algorithm

To use the FineMerge algorithm, first create a pickle file containing a list of dict elements where each element is of the form:

{ 'utt_id' : 'local_probs' : ..., 'service_transcript' : ..., 'word_confs' : ... }

Here, local_probs is the probability matrix output from local model, service_transcipt and word_confs refer to the transcript and correspoding word confidence values from the ASR service. The length of the service transcript and word confs must be equal.

Run python main.py --help on further instruction on how to use FineMerge algorithim.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%