@inproceedings{tachibana2018efficiently,
author={Tachibana, H. and Uenoyama, K. and Aihara, S.},
title={Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention},
booktitle={2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
publisher={IEEE},
month={Apr},
year={2018},
DOI={10.1109/icassp.2018.8461829}
}
Synthesized audio (15 hours training).
Note: the words “ICASSP” and “acoustics” are not included in our training data.
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
Note: This famous quote is not included in the training data.
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |
Note: This famous quote is not included in the training data.
training time | synthesized audio |
---|---|
2 hours | |
7 hours | |
15 hours | |
40 hours |