Python KaldiInterface.new_model 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: elpis.wrappers.objects.interface

클래스/타입: KaldiInterface

메소드/함수: new_model

hotexamples.com에서의 예제들: 2

Python KaldiInterface.new_model - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 elpis.wrappers.objects.interface.KaldiInterface.new_model에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

KaldiInterface(30)

new_dataset(30)

new_pron_dict(17)

new_transcription(8)

list_datasets(3)

load(3)

list_pron_dicts(2)

new_model(2)

list_transcriptions(1)

예제 #1

파일 보기

def mock_model(tmpdir_factory):
    base_path = tmpdir_factory.mktemp("pipeline")
    base_path = Path(base_path)
    if not base_path.joinpath('/state').exists():
        kaldi = KaldiInterface(f'{base_path}/state')

        ds = kaldi.new_dataset('dataset_x')
        ds.add_directory('/recordings/transcribed')
        ds.select_importer('Elan')
        ds.process()

        pd = kaldi.new_pron_dict('pron_dict_y')
        pd.link(ds)
        pd.set_l2s_path('/recordings/letter_to_sound.txt')
        pd.generate_lexicon()

        m = kaldi.new_model('model_z')
        m.link(ds, pd)
        m.build_kaldi_structure()  # TODO: remove this line
        m.train()  # may take a while
    else:
        kaldi = KaldiInterface.load(f'{base_path}/state')
        m = kaldi.new_model('model_z', use_existing=True)
    return (kaldi, m)

예제 #2

파일 보기

ds = kaldi.new_dataset('dsy')
ds.add_directory('/recordings/transcribed', filter=['eaf', 'wav'])
ds.process()

# Step 2
# ======
# Build pronunciation dictionary
pd = kaldi.new_pron_dict('pd')
pd.link(ds)
pd.set_l2s_path('/recordings/letter_to_sound.txt')
pd.generate_lexicon()

# Step 3
# ======
# Link dataset and pd to a new model, then train the model.
m = kaldi.new_model('mx')
m.link(ds, pd)
m.build_kaldi_structure()
m.train() # may take a while

# Step 4
# ======
# Make a transcription interface and transcribe unseen audio to elan.
t = kaldi.new_transcription('tx')
t.link(m)
with open('/recordings/untranscribed/audio.wav', 'rb') as faudio:
    t.prepare_audio(faudio)
# t.transcribe_align()
t.transcribe()
# print(t.elan().decode('utf-8'))
print(t.text().decode('utf-8'))