WaveNetを用いた言語情報なし感情音声合成における感情の強さ制御の検討

Authors: 松本剣斗, 原直, 阿部匡伸 (岡山大学)

In this page, there are generated speeches by our proposed method.

Angry


α=0.5
α=0.4
α=0.3
α=0.2
α=0.1
α=0.0

Happy


α=0.5
α=0.4
α=0.3
α=0.2
α=0.1
α=0.0


MOS score = 4.42 (α=0.0, best)
MOS score = 2.00 (α=0.5, worst)

Contact

k_matsu＠s.okayama-u.ac.jp (＠->@)