Visual Speech Recognition using Active Shape Models and Hidden Markov Models

Luettin, Juergen; Thacker, Neil A.; Beet, Steve W.

doi:10.1109/ICASSP.1996.543246

conference paper

Visual Speech Recognition using Active Shape Models and Hidden Markov Models

Luettin, Juergen

•

Thacker, Neil A.

•

Beet, Steve W.

1996

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'96)

This paper describes a novel approach for visual speech recognition. The shape of the mouth is modelled by an Active Shape Model which is derived from the statistics of a training set and used to locate, track and parameterise the speakerï¿½s lip movements. The extracted parameters representing the lip shape are modelled as continuous probability distributions and their temporal dependencies are modelled by Hidden Markov Models. We present recognition tests performed on a database of a broad variety of speakers and illumination conditions. The system achieved an accuracy of 85.42 % for a speaker independent recognition task of the first four digits using lip shape information only.

Name

mai96.pdf

Access type

openaccess

Size

59 KB

Format

Adobe PDF

Checksum (MD5)

466f5d2c8be55d531bfc562b1cf79654