Skip to content

jinwooklim/audi-gan

Repository files navigation

Technique that Effectively Preserves Audio Identity in Bidirectional Style Conversion Using GAN

This is the code implementation for project submitted as the partial requirement for the 'Machine Learning Application' course in Korea University.

Instruction

1. Make Folders

python mkdir.py

(Option) 2. Combine short length audio into one file

python compile.py --istrain --istest

3. Preprocess for train and evaluate

python gen_npy.py --iscut
python gen_npy.py --isnpy

4. Train model

python model.py

5. Evaluate test data

python target_inference.py

6. Make all result files one

python compile.py --mkresult

7. Get spectrograms

python gen_spectrograms.py

8. Euclidean Distance Test

python distance.py

Notes

Spectrograms

Figure 4. in Paper figure4

Euclidean distance test

Table 1. in Paper

DATASET MEAN STD
Train A & Test A 533.48 12.11
Train A & G_BA 510.24 11.22
Train A & Test A 558.32 8.67
Train A & G_AB 500.7 8.39

About

Final Term Project of 'Machine Learning Application' course at Korea University

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages