Sound demos for "Neural Synthesis of Sound Effects Using Flow-Based Deep Generative Models"

We demonstrate sound effect variations synthesized by the models described in our paper.
Note that the example sounds used to condition the models are qualitatively different from the training and test data with which the models where trained and evaluated in the paper.

All the original sounds have been obtained from the website Freesound (https://freesound.org/) and downsampled to 16KHz when needed.


1. Examples of explosion sounds

Variations synthesized using mel spectrograms computed from explosion sounds.

1.1 Dimensionality of the mel spectrogram conditioner

Demonstration of results from models using 10 and 30 mel bands spectrograms as conditioners at 50k training iterations (10ch_50k and 30ch_50k).

Blast [1]
Model 10ch_50k       Model 30ch_50k      

Explosion [2]
Model 10ch_50k       Model 30ch_50k      

Large explosion [3]
Model 10ch_50k       Model 30ch_50k      

Guns explosion [4]
Model 10ch_50k       Model 30ch_50k      


1.2 Training iterations

Demonstration of results from a model using 20 mel bands at different training iterations (20ch_10k, 20ch_50k, 20ch_200k.

Blast [1]
20ch_10k       20ch_50k       20ch_200k      

Explosion [2]
20ch_10k       20ch_50k       20ch_200k      

Large explosion [3]
20ch_10k       20ch_50k       20ch_200k      

Guns explosion [4]
20ch_10k       20ch_50k       20ch_200k      


1.3 Examples of post-processing strategies

Demonstration of results from 20ch_50k using different post-processing strategies. We refer to the paper for more details.

Blast [1]
Unprocessed       20ch_50k       Ultraprocessed      

Explosion [2]
Unprocessed       20ch_50k       Ultraprocessed      



2. Examples of style transfer

Variations synthesized using mel spectrograms computed from non-explosion sounds using the model 20ch_50k.

Piano [5]
Original       Run 1       Run 2      

Water splash [6]
Original       Run 1       Run 2      

Guitar [7]
Original       Run 1       Run 2      

Timpani [8]
Original       Run 1       Run 2      


Attributions

Blast sound [1] by: Freesound user "Benboncan", licensed under CC BY 4.0 https://freesound.org/people/Benboncan/sounds/73005/
Explosion sound [2] by: Freesound user "Quarker540", licensed under CC0 1.0 https://freesound.org/people/Quaker540/sounds/245372/
Large explosion sound [3] by: Freesound user "TheSoundFXGuy", licensed under CC BY 3.0 https://freesound.org/people/TheSoundFXGuy_YT/sounds.0 https://freesound.org/people/TheSoundFXGuy_YT/sounds/534217/
Guns explosion sound [4] by: Freesound user "OGsoundFX", licensed under CC BY 3.0 https://freesound.org/people/OGsoundFX/sounds/423111/
Piano sound [5] by: Freesound user "jobro", licensed under CC BY 3.0 https://freesound.org/people/jobro/sounds/39164/
Water splash sound [6] by: Freesound user "qubodup", licensed under CC0 1.0 https://freesound.org/people/qubodup/sounds/442773/
Guitar sound [7] by: Freesound user "tosha73", licensed under CC0 4.0 https://freesound.org/people/tosha73/sounds/533847/
Timpani sound [8] by: Freesound user "Sorinious_Genious", licensed under CC0 1.0 https://freesound.org/people/Sorinious_Genious/sounds/573242/

Licenses

CC0 1.0 https://creativecommons.org/publicdomain/zero/1.0/
CC BY 3.0 https://creativecommons.org/licenses/by/3.0/
CC BY 4.0 https://creativecommons.org/licenses/by/4.0/