Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis
 
conference paper

SVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis

Lazaridis, Alexandros
•
Honnet, Pierre-Edouard
•
Garner, Philip N.
2014
7th International Conference on Speech Prosody
Speech Prosody

In this paper we investigate external phone duration models (PDMs) for improving the quality of synthetic speech in hidden Markov model (HMM)-based speech synthesis. Support Vector Regression (SVR) and Multilayer Perceptron (MLP) were used for this task. SVR and MLP PDMs were compared with the explicit duration modelling of hidden semi-Markov models (HSMMs). Experiments done on an American English database showed the SVR outperforming the MLP and HSMM duration modelling on objective and subjective evaluation. In the objective test, SVR managed to outperform MLP and HSMM models achieving 15.3% and 25.09% relative improvement in terms of root mean square error (RMSE) respectively. Moreover, in the subjective evaluation test, on synthesized speech, the SVR model was preferred over the MLP and HSMMmodels, achieving a preference score of 35.93% and 56.30%, respectively.

  • Details
  • Metrics
Type
conference paper
DOI
10.21437/SpeechProsody.2014-198
Author(s)
Lazaridis, Alexandros
Honnet, Pierre-Edouard
Garner, Philip N.
Date Issued

2014

Published in
7th International Conference on Speech Prosody
Start page

1047

End page

1051

Subjects

HMM-based speech synthesis

•

HSMM explicit duration modelling

•

multilayer perceptron

•

phone duration modelling

•

Support Vector Regression

URL

Related documents

http://publications.idiap.ch/index.php/publications/showcite/Lazaridis_Idiap-RR-03-2014
Written at

EPFL

EPFL units
LIDIAP  
Event name
Speech Prosody
Available on Infoscience
April 19, 2014
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/102907
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés