Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens

Oertel, Catharine; Lopes, José David; Yu, Yu; Funes Mora, Kenneth Alberto; Gustafson, Joakim; Black, Alan; Odobez, Jean-Marc

doi:10.1145/2993148.2993188

Oertel, Catharine; Lopes, José David; Yu, Yu; Funes Mora, Kenneth Alberto; Gustafson, Joakim; Black, Alan; Odobez, Jean-Marc

2016

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Current dialogue systems typically lack a variation of audio-visual feedback tokens. Either they do not encompass feedback tokens at all, or only support a limited set of stereotypical functions. However, this does not mirror the subtleties of spontaneous conversations. If we want to be able to build an artificial listener, as a first step towards building an empathetic artificial agent, we also need to be able to synthesize more subtle audio-visual feedback tokens. In this study, we devised an array of monomodal and multimodal binary comparison perception tests and experiments to understand how different realisations of verbal and visual feedback tokens influence third-party perception of the degree of attentiveness. This allowed us to investigate i) which features (amplitude, frequency, duration...) of the visual feedback influences attentiveness perception; ii) whether visual or verbal backchannels are perceived to be more attentive iii) whether the fusion of unimodal tokens with low perceived attentiveness increases the degree of perceived attentiveness compared to unimodal tokens with high perceived attentiveness taken alone; iv) the automatic ranking of audio-visual feedback token in terms of conveyed degree of attentiveness.

Details

Title Towards building an attentive artificial listener: on the perception of attentiveness in audio-visual feedback tokens

Author(s) Oertel, Catharine ; Lopes, José David ; Yu, Yu ; Funes Mora, Kenneth Alberto ; Gustafson, Joakim ; Black, Alan ; Odobez, Jean-Marc

Published in Icmi'16: Proceedings Of The 18Th Acm International Conference On Multimodal Interaction

Pagination 8

Pages 21-28

Conference Proceedings of the 18th ACM International Conference on Multimodal Interaction, Tokyo, Japan

Date 2016

Publisher New York, ACM

ISBN 978-1-4503-4556-9

Keywords

backchannels; head nods; virtual agents

DOI https://doi.org/10.1145/2993148.2993188

Other identifier(s) View record in Web of Science

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2016-12-19

Abstract

Details

Actions