Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Speech recognition with speech synthesis models by marginalising over decision tree leaves
 
conference paper

Speech recognition with speech synthesis models by marginalising over decision tree leaves

Dines, John  
•
Saheer, Lakshmi
•
Liang, Hui  
2009
Interspeech 2009
Proceedings of Interspeech

There has been increasing interest in the use of unsupervised adaptation for the personalisation of text-to-speech (TTS) voices, particularly in the context of speech-to-speech translation. This requires that we are able to generate adaptation transforms from the output of an automatic speech recognition (ASR) system. An approach that utilises unified ASR and TTS models would seem to offer an ideal mechanism for the application of unsupervised adaptation to TTS since transforms could be shared between ASR and TTS. Such unified models should use a common set of parameters. A major barrier to such parameter sharing is the use of differing contexts in ASR and TTS. In this paper we propose a simple approach that generates ASR models from a trained set of TTS models by marginalising over the TTS contexts that are not used by ASR. We present preliminary results of our proposed method on a large vocabulary speech recognition task and provide insights into future directions of this work.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.21437/Interspeech.2009-430
Author(s)
Dines, John  
Saheer, Lakshmi
Liang, Hui  
Date Issued

2009

Published in
Interspeech 2009
Start page

1395

End page

1398

Subjects

speech recognition

•

decision trees

•

unified models

•

speech synthesis

URL

URL

http://publications.idiap.ch/downloads/papers/2009/Dines_INTERSPEECH-2_2009.pdf

Related documents

http://publications.idiap.ch/index.php/publications/showcite/Dines_Idiap-RR-17-2009
Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent place
Proceedings of Interspeech

Brighton, U.K.

Available on Infoscience
February 11, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/46704
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés