Serenade: A Singing Style Conversion Framework Based On Audio Infilling
Audio Samples
Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda
Nagoya University, Japan
All samples are available at
this Google Drive.
Source singing style: Breathy
System and Description |
to Falsetto |
to Mixed |
to Pharyngeal |
Target GT |
|
|
|
Reference Style |
|
|
|
NU-SVC |
|
|
|
Ablation 1 |
|
|
|
Ablation 2 |
|
|
|
Ablation 3 |
|
|
|
Serenade |
|
|
|
Serenade (SiFiGAN) |
|
|
|
Source singing style: Falsetto
System and Description |
to Breathy |
to Mixed |
to Pharyngeal |
Target GT |
|
|
|
Reference Style |
|
|
|
NU-SVC |
|
|
|
Ablation 1 |
|
|
|
Ablation 2 |
|
|
|
Ablation 3 |
|
|
|
Serenade |
|
|
|
Serenade (SiFiGAN) |
|
|
|
Source singing style: Mixed
System and Description |
to Breathy |
to Falsetto |
to Pharyngeal |
Target GT |
|
|
|
Reference Style |
|
|
|
NU-SVC |
|
|
|
Ablation 1 |
|
|
|
Ablation 2 |
|
|
|
Ablation 3 |
|
|
|
Serenade |
|
|
|
Serenade (SiFiGAN) |
|
|
|
Source singing style: Pharyngeal
System and Description |
to Breathy |
to Falsetto |
to Mixed |
Target GT |
|
|
|
Reference Style |
|
|
|
NU-SVC |
|
|
|
Ablation 1 |
|
|
|
Ablation 2 |
|
|
|
Ablation 3 |
|
|
|
Serenade |
|
|
|
Serenade (SiFiGAN) |
|
|
|