Skip to content

alexhanbing/recognizer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

Recognize Speakers and Publish Results

Messages Structure

messages topic payload
recognize ais/recognize/voice/+device_id bytes of voice
set name ais/recognize/setname/+device_id "audio_path=name"
recognition result ais/recognize/result/+device_id/+audio_path "speaker_name" or "unknown"

Tasks

  • surgemq based MQTT broker
  • test publish/subscribe message
  • recognition in new thread with voice file specified in message payload
  • train speakers model with given voice
  • test recognition of trained model
  • voice transfer by mqtt payload?(try..)
    • receive bytearray(sent by mock-speaker), save to file
    • pass the file to GStreamer(restricted by the implementation of voiceid)
    • receive bytearray sent from iOS client
  • rewrite the method to add/modify voice model!
    • merge voice models of single speaker
  • post to server
  • handle name whitespace
  • message retain

References

About

Recognize Speakers and Publish Results

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%