Skip to content

sh0sh1n/AutoViDevSpeech

 
 

Repository files navigation

AutoViDevSpeech

Welcome! This is the project AutoViDevSpeech, and we are waiting for you!

Introduction

Many artificial intelligence (AI) researchers draw important insights from developmental science. Conversely, relatively few developmental researchers use AI and speech recognition tools to facilitate the study of behavioral development. Most developmental researchers lack expertise in AI and speech recognition, so the enormous potential of AI and speech recognition to speed progress in video-based developmental research remains untapped. The current project this notable gap by using state-of-the-art algorithm to transcribe videos of developmental research automatically. This tool is part of AutoViDev—an automatic video-analysis tool that uses machine learning to support video-based developmental research.

Prerequisite

  1. Python 3.6
  2. ffmeg package
  3. SpeechRecognition Package
  4. Any API here:
    • CMU
    • Google Speech Recognition
    • Google Cloud Speech API
    • Wit.ai
    • Microsoft Bing Voice Recognition
    • Houndify API
    • IBM Speech to Text

About

Transcribe audio

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 85.5%
  • Ruby 14.5%