Speaker identification by lipreading

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

doi:10.21437/ICSLP.1996-16

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

1996

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

This paper describes a new approach for speaker identification based on lipreading. Visual features are extracted from image sequences of the talking face and consist of shape parameters which describe the lip boundary and intensity parameters which describe the grey-level distribution of the mouth area. Intensity information is based on principal component analysis using eigenspaces which deform with the shape model. The extracted parameters account for both, speech dependent and speaker dependent information. We built spatio-temporal speaker models based on these features, using HMMs with mixtures of Gaussians. Promising results were obtained for text dependent and text independent speaker identification tests performed on a small video database.

Details

Title Speaker identification by lipreading

Author(s) Luettin, Juergen ; Thacker, Neil A. ; Beet, Steve W.

Published in Proceedings of the 4th International Conference on Spoken Language Processing (ICSLP'96)

Volume 1

Pages 62-65

Conference 4th International Conference on Spoken Language Processing (ICSLP'96)

Date 1996

Keywords

vision

DOI https://doi.org/10.21437/ICSLP.1996-16

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2006-03-10

Actions

Preview

Select file: