Skip to content

MaciejMarkiewicz/captionizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Captionizer

Captionizer enables to create transcriptions of lectures, online meetings or videos in the docx format. It can capture sounds from any audio input, or speakers output (stereo mix). 2 speech recognition engines are supported (Google and Azure).

Tests and accuracy

The app is intended to transcribe lectures and other mostly single-speaker content, so such examples were chosen for tests. The Azure engine performs much better than the Google one, so I will use it for the most part. Mistake ratio is described as: (wrong words + 0.5 * correct words, but in a wrong form) / all words. Punctuation accuracy is not measured, although it's worth noting, that only Azure engine provides it (it has a solid accuracy in English, and acceptable in Polish)

AZURE / ENG:

AZURE / PL

GOOGLE / ENG

GOOGLE / PL

Usage & UI

  • Install all dependencies - pip install -r reguirements.txt
  • Enable stereo mix in Windows -> right click on the sound icon on the taskbar -> sounds -> recording -> stereo mix -> enable
  • Get an api key for azure cognitive services (preferred) or google speech api (you can use the default google key, but it is very limited). Be aware that changing your default recording device might result in changing your current online meeting software default settings.
  • Run the app:

image info

  • Go to preferences and set the engine, key and language. Set stereo mix as the input device, if you want to transcribe the sounds you hear:

image info

  • Press record to start recording
  • Press "Finish recording" to stop:

image info

  • Wait until transcription process finishes:

image info

  • Transcription is stored to the specified path:

image info

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages