Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Fast Approximate Spoken Term Detection from Sequence of Phonemes
 
conference paper not in proceedings

Fast Approximate Spoken Term Detection from Sequence of Phonemes

Pinto, Joel Praveen  
•
Szoke, Igor
•
Prasanna, S. R. Mahadeva
Show more
2008
Workshop on Searching Spontaneous Conversational Speech at SIGIR

We investigate the detection of spoken terms in conversational speech using phoneme recognition with the objective of achieving smaller index size as well as faster search speed. Speech is processed and indexed as a sequence of one best phoneme sequence. We propose the use of a probabilistic pronunciation model for the search term to compensate for the errors in the recognition of phonemes. This model is derived using the pronunciation of the word and the phoneme confusion matrix. Experiments are performed on the conversational telephone speech database distributed by NIST for the 2006 spoken term detection. We achieve about 1500 times smaller index size and 14 times faster search speed compared to the state-of-the-art system using phoneme lattice at the cost of relatively lower detection performance.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

pinto-SSCS-2008.pdf

Access type

openaccess

Size

119.79 KB

Format

Adobe PDF

Checksum (MD5)

4553c18a13af7625e34b7c9fe3902418

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés