DNN-based Speech Synthesis: Importance of input features and training data

Lazaridis, Alexandros; Potard, Blaise; Garner, Philip N.

doi:10.1007/978-3-319-23132-7_24

Lazaridis, Alexandros; Potard, Blaise; Garner, Philip N.

2015

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Deep neural networks (DNNs) have been recently introduced in speech synthesis. In this paper, an investigation on the importance of input features and training data on speaker dependent (SD) DNN-based speech synthesis is presented. Various aspects of the training procedure of DNNs are investigated in this work. Additionally, several training sets of different size (i.e., 13.5, 3.6 and 1.5 h of speech) are evaluated.

Details

Title DNN-based Speech Synthesis: Importance of input features and training data

Author(s) Lazaridis, Alexandros ; Potard, Blaise ; Garner, Philip N.

Published in Speech and Computer

Editor(s)

Ronzhin, A. ; Potapova, R. ; Fakotakis, N.

Series Lecture Notes in Computer Science, 9319

Pages 193-200

Date 2015

Publisher Springer Berlin Heidelberg

ISBN 978-3-319-23131-0

DOI https://doi.org/10.1007/978-3-319-23132-7_24

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Book chapters
Published

Record creation date 2015-06-19

Abstract

Details

Actions