Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features
 
conference paper

Cross-lingual Automatic Speech Recognition Exploiting Articulatory Features

Zhan, Qingran
•
Motlicek, Petr
•
Du, Shixuan
Show more
2019
Proceedings of APSIPA ASC 2019
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Articulatory features (AFs) provide language-independent attribute by exploiting the speech production knowl-edge. This paper proposes a cross-lingual automatic speechrecognition (ASR) based on AF methods. Various neural network(NN) architectures are explored to extract cross-lingual AFs andtheir performance is studied. The architectures include muti-layer perception(MLP), convolutional NN (CNN) and long short-term memory recurrent NN (LSTM). In our cross-lingual setup,only the source language (English, representing a well-resourcedlanguage) is used to train the AF extractors. AFs are thengenerated for the target language (Mandarin, representing anunder-resourced language) using the trained extractors. Theframe-classification accuracy indicates that the LSTM has anability to perform a knowledge transfer through the robust cross-lingual AFs from well-resourced to under-resourced language.The final ASR system is built using traditional approaches(e.g. hybrid models), combining AFs with conventional MFCCs.The results demonstrate that the cross-lingual AFs improvethe performance in under-resourced ASR task even though thesource and target languages come from different language family.Overall, the proposed cross-lingual ASR approach provides slightimprovement over the monolingual LF-MMI and cross-lingual(acoustic model adaptation-based) ASR systems.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/APSIPAASC47483.2019.9023195
Author(s)
Zhan, Qingran
Motlicek, Petr
Du, Shixuan
Shan, Yahui
Xie, Xiang
Ma, Sifan
Date Issued

2019

Published in
Proceedings of APSIPA ASC 2019
Start page

1912

End page

1916

URL
http://www.apsipa.org/proceedings/2019/pdfs/429.pdf
Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

Lanzhou, China

18-21 November 2019

Available on Infoscience
February 18, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/166365
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés