Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Open-Vocabulary Keyword Spotting with Audio and Text Embeddings
 
conference paper

Open-Vocabulary Keyword Spotting with Audio and Text Embeddings

Sacchi, Niccolò
•
Nanchen, Alexandre
•
Jaggi, Martin  
Show more
2019
Interspeech 2019 - IEEE International Conference on Acoustics, Speech, and Signal Processing
INTERSPEECH 2019 - IEEE International Conference on Acoustics, Speech, and Signal Processing

Keyword Spotting (KWS) systems allow detecting a set of spoken (pre-defined) keywords. Open-vocabulary KWS systems search for the keywords in the set of word hypotheses generated by an automatic speech recognition (ASR) system which is computationally expensive and, therefore, often implemented as a cloud-based service. Besides, KWS systems could use also word classification algorithms that do not allow easily changing the set of words to be recognized, as the classes have to be defined a priori, even before training the system. In this paper, we propose the implementation of an open-vocabulary ASR-free KWS system based on speech and text encoders that allow matching the computed embeddings in order to spot whether a keyword has been uttered. This approach would allow choosing the set of keywords a posteriori while requiring low computational power. The experiments, performed on two different datasets, show that our method is competitive with other state of the art KWS systems while allowing for a flexibility of configuration and being computationally efficient.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Sacchi_INTERSPEECH_2019.pdf

Access type

openaccess

License Condition

CC BY

Size

1.53 MB

Format

Adobe PDF

Checksum (MD5)

37d6e0278710e49adba4a3c08346b9a1

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés