VOCODER Experiments

These are a series of projects that we are working on with an aim to building high quality speech synthesis systems.

Updates

In December 2017, we have tried experiments on upgrading our speech representation to WORLD.
As of March 2018, festvox voice building tools support WORLD as representation. Checkout our repo and notebook


In Summer 2018, we are working on upgrading our vocoder to WaveNet.



21 December 2017

FESTIVAL + WORLD + 6 layer DNN


AWB ARCTIC


ARCTIC A0029 ABS

ARCTIC A0029 TEST

29 December 2017

FESTIVAL + WORLD + 6 layer DNN


AWB ARCTIC


ARCTIC A0029 128 Tanh 0.2 Dropout SGD

ARCTIC A0029 128 Tanh 0.3 Dropout ADAM

ARCTIC A0029 200 Tanh 0.2 Dropout ADAM

ARCTIC A0029 512 Tanh 0.1 Dropout ADAM

ARCTIC A0029 512 Tanh 0.3 Dropout ADAM

ARCTIC A0029 1024 Tanh 0.1 Dropout ADAM

ARCTIC A0029 1024 Tanh 0.2 Dropout ADAM

ARCTIC A0029 1024 Tanh 0.3 Dropout ADAM

ARCTIC A0029 TEST (previous week)

05 January 2018

FESTIVAL + WORLD + 6 layer SELU DNN


AWB ARCTIC


ARCTIC A0029 512 SGD No Dropout No Context No Normalization

ARCTIC A0039 512 SGD No Dropout No Context No Normalization

12 January 2018

FESTIVAL + WORLD + 6 layer SELU DNN + Dynet


AWB ARCTIC


ARCTIC A0029 1024 SGD No Dropout No Context No Normalization

ARCTIC A0029 1024 SGD 0.2 Dropout No Context No Normalization

WAVENET EXPERIMENTS

06 June 2018

AWB ARCTIC


20K STEPS
40K STEPS
80K STEPS
160K STEPS

240K STEPS
260K STEPS