Learning to recognise talking faces

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

doi:10.1109/ICPR.1996.547233

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

1996

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

An approach for person identification is described based on spatio-temporal analysis of the talking face. A person is represented by a parametric model of the visible speech articulators and their temporal characteristics during speech production. The model consists of shape parameters, representing the lip contour and intensity parameters representing the grey level distribution in the mouth region. The model is used to track lips in image sequences where the model parameters are recovered from the tracking results. While some of these parameters relate to speech information, others are intuitively related to different persons and we show that models based on these features enable successful person identification. We model the shape and intensity parameters as mixtures of Gaussians and their temporal dependencies by Hidden Markov Models. Identifying a talking person is performed by estimating the likelihood of each model for having generated the observed sequence of features and the model with the highest likelihood is chosen as the identified person.

Details

Title Learning to recognise talking faces

Author(s) Luettin, Juergen ; Thacker, Neil A. ; Beet, Steve W.

Published in Proceedings of the International Conference on Pattern Recognition (ICPR'96)

Volume IV

Pages 55-59

Conference IAPR - Proceedings of the International Conference on Pattern Recognition (ICPR'96)

Date 1996

Keywords

vision

DOI https://doi.org/10.1109/ICPR.1996.547233

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Files

Abstract

Details

PDF