Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Multilingual bottleneck features for subword modeling in zero-resource languages
 
conference paper

Multilingual bottleneck features for subword modeling in zero-resource languages

Hermann, Enno
•
Goldwater, Sharon
2018
Interspeech 2018
Interspeech 2018

How can we effectively develop speech technology for languages where no transcribed data is available? Many existing approaches use no annotated resources at all, yet it makes sense to leverage information from large annotated corpora in other languages, for example in the form of multilingual bottleneck features (BNFs) obtained from a supervised speech recognition system. In this work, we evaluate the benefits of BNFs for subword modeling (feature extraction) in six unseen languages on a word discrimination task. First we establish a strong unsupervised baseline by combining two existing methods: vocal tract length normalisation (VTLN) and the correspondence autoencoder (cAE). We then show that BNFs trained on a single language already beat this baseline; including up to 10 languages results in additional improvements which cannot be matched by just adding more data from a single language. Finally, we show that the cAE can improve further on the BNFs if high-quality same-word pairs are available.

  • Details
  • Metrics
Type
conference paper
DOI
10.21437/Interspeech.2018-2334
Author(s)
Hermann, Enno
Goldwater, Sharon
Date Issued

2018

Published in
Interspeech 2018
Start page

2668

End page

2672

Subjects

multilingual bottleneck features

•

subword modeling

•

unsupervised feature extraction

•

zero-resource speech technology

URL

Related documents

http://publications.idiap.ch/downloads/papers/2018/Hermann_INTERSPEECH_2018.pdf
Written at

EPFL

EPFL units
LIDIAP  
Event name
Interspeech 2018
Available on Infoscience
December 28, 2018
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/153239
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés