About

BC-Hacks 2021 Submission

About

This idea stemmed from the recent popularity of deep fakes and the goal of this project was to apply this type of technology in a beneficial way rather than the harmful ways it could potenetially be used. This tool allows the user to process a video clip with an audio or text file of an alternate language and lipsync the video to that language.

This could be used in a real life situation where a political figure is delivering a speech and convert that to any native language for a wider reach or even be used to fix poor audio dubbing in movies and TV. Example here.

If a translated audio source is not available, the user could also provide a .txt file of the translated text and specify the language to generate an artifical voice. Although the artificial voice is not ideal, with enough data, the voice could be trained to mimic the original speaker. Example of generated voice here.

Since this project was limited to 24 hours, I was only able to generate <5sec clips as examples.

Disclaimer

This tool is for research purposes only, please see the following dependencies for information on licensing:

Usage

Dependancies

Python 3.6
ffmpeg

Getting started

This will take up a fair amount of space on your machine since it automatically downloads multiple pretrained models

Run python init.py
Download these weights and place wav2lip.pth in ./dependencies/Wav2Lip/checkpoints/

Process a Video

This can take quite a bit of time, with an integrated graphics chip it took me almost an hour to process a 5 second clip

To process a clip with a pre-existing audio clip run:

    python process.py --video <video-path> --audio <audio-path>

To process a clip with a translated txt file run:

    python process.py --video <video-path> --text <txt-path>

Results will be saved to the results file

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
results		results
temp		temp
.gitignore		.gitignore
README.md		README.md
adjuster.py		adjuster.py
init.py		init.py
logo.png		logo.png
process.py		process.py
voice_generator.py		voice_generator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

temp

temp

.gitignore

.gitignore

README.md

README.md

adjuster.py

adjuster.py

init.py

init.py

logo.png

logo.png

process.py

process.py

voice_generator.py

voice_generator.py

Repository files navigation

BC-Hacks 2021 Submission

About

Disclaimer

Usage

Dependancies

Getting started

Process a Video

About

Releases

Packages

Languages

omurovec/Video-Speech-Translator

Folders and files

Latest commit

History

Repository files navigation

BC-Hacks 2021 Submission

About

Disclaimer

Usage

Dependancies

Getting started

Process a Video

About

Resources

Stars

Watchers

Forks

Languages