The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input)

Hermansky, Hynek; Fousek, Petr; Lehtonen, Mikko

doi:10.1007/11551874_2

Hermansky, Hynek; Fousek, Petr; Lehtonen, Mikko

2005

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Natural audio-visual interface between human user and machine requires understanding of user's audio-visual commands. This does not necessarily require full speech and image recognition. It does require, just as the interaction with any working animal does, that the machine is capable of reacting to certain particular sounds and/or gestures while ignoring the rest. Towards this end, we are working on sound identification and classification approaches that would ignore most of the acoustic input and react only to a particular sound (keyword).

Details

Title The Role of Speech in Multimodal Human-Computer Interaction (Towards Reliable Rejection of Non-Keyword Input)

Author(s) Hermansky, Hynek ; Fousek, Petr ; Lehtonen, Mikko

Published in TSD 2005: Text, Speech and Dialogue

Pages 2–8

Conference 8th International Conference on Text, Speech and Dialogue - TSD 2005, Karlovy Vary, Czech Republic, September 12-15, 2005

Date 2005

Keywords

speech

Note IDIAP-RR 2005-63

DOI https://doi.org/10.1007/11551874_2

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Files

Abstract

Details

PDF