Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. A Comparison of Methods for OOV-Word Recognition on a New Public Dataset
 
conference paper

A Comparison of Methods for OOV-Word Recognition on a New Public Dataset

Braun, Rudolf
•
Madikeri, Srikanth
•
Motlicek, Petr  
2021
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE International Conference on Acoustics, Speech and Signal Processing

A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a new tool for calculating relevant performance metrics. We then evaluate, within the context of a hybrid ASR system, how much better subword models are at recognizing OOVs, and how much benefit one can get from incorporating OOV-word information into an existing system by modify ing WFSTs. Additionally, we propose a new method for modifying a subword-based language model so as to better recognize OOV-words. We showcase very large improvements in OOV-word recognition and make both the data and code available.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP39728.2021.9415124
Author(s)
Braun, Rudolf
Madikeri, Srikanth
Motlicek, Petr  
Date Issued

2021

Publisher

IEEE Signal Processing Society

Published in
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

5979

End page

5983

URL

Link to IDIAP database

http://publications.idiap.ch/downloads/papers/2021/Braun_ICASSP2021_2021.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event name
IEEE International Conference on Acoustics, Speech and Signal Processing
Available on Infoscience
April 13, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/177311
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés