Skip to content

tttslab/sup-mlt-demo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning Demo

Overview

This repository contains demo codes for the paper Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning. It consists of the trained models, the python inference codes, and a simple frontend webpage as well as a backend nodejs server.

How To Use

Use the demo with the frontend webpage

  1. clone from https://github.com/Porridge144/sup-mlt-demo.git
  2. cd model_export and run python pyonnxrt.py (you might need to run it in background or in a tmux window as it is blocking)
  3. cd server and run node server.js (you can change the listening port to an arbitrary one in server.js)

Direct inference without using the frontend webpage

  1. clone from https://github.com/Porridge144/sup-mlt-demo.git
  2. put intended mp3/wav s into model_export/feat_extract/preprocdir/rawmp3
  3. cd model_export and run python pyonnxrt.py (you might need to run it in background or in a tmux window as it is blocking)
  4. cd model_export/feat_extract and run bash run.sh
  5. output will be saved in the server and also printed in the terminal which pyonnxrt.py is running

About Author

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.4%
  • Other 1.6%