Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Improving acoustic based keyword spotting using LVCSR lattices
 
conference paper

Improving acoustic based keyword spotting using LVCSR lattices

Motlicek, Petr
•
Valente, Fabio
•
Szoke, Igor
2012
2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
IEEE International Conference on Acoustics, Speech and Signal Processing

This paper investigates detection of English keywords in a conversational scenario using a combination of acoustic and LVCSR based keyword spotting systems. Acoustic KWS systems search predefined words in parameterized spoken data. Corresponding confidences are represented by likelihood ratios given the keyword models and a background model. First, due to the especially high number of false-alarms, the acoustic KWS system is augmented with confidence measures estimated from corresponding LVCSR lattices. Then, various strategies to combine scores estimated by the acoustic and several LVCSR based KWS systems are explored. We show that a linear regression based combination significantly outperforms other (model-based) techniques. Due to that, the relative number of false-alarms of the combined KWS system decreased by more than 50% compared to the acoustic KWS system. Finally, an attention is also paid to the complexities of the KWS systems enabling them to potentially be exploited in real-detection tasks.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2012.6288898
Author(s)
Motlicek, Petr
Valente, Fabio
Szoke, Igor
Date Issued

2012

Published in
2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Start page

4413

End page

4416

URL

Related documents

http://publications.idiap.ch/index.php/publications/showcite/Motlicek_Idiap-RR-36-2012
Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent place
IEEE International Conference on Acoustics, Speech and Signal Processing

Kyoto, Japan

Available on Infoscience
December 19, 2013
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/98341
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés