Statistical lip modelling for visual speech recognition

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

1996

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

We describe a speechreading (lipreading) system purely based on visual features extracted from grey level image sequences of the speakers lips. Active shape models are used to track the lip contours while visual speech information is extracted from the shape of the contours. The distribution and temporal dependencies of the shape features are modelled by continuous density Hidden Markov Models. Experiments are reported for speaker independent recognition tests of isolated digits. The analysis of individual feature components suggests that speech relevant information is embedded in a low dimensional space and fairly robust to inter- and intra- speaker variability.

Détails

Titre Statistical lip modelling for visual speech recognition

Auteur(s) Luettin, Juergen ; Thacker, Neil A. ; Beet, Steve W.

Publié dans Proceedings of the 8th European Signal Processing Conference (Eusipco'96)

Volume I

Pages 137-140

Présenté à Proceedings of the 8th European Signal Processing Conference (Eusipco'96)

Date 1996

Mots-clés (libres)

vision

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2006-03-10

Files

Résumé

Détails

PDF