Automatic extraction of geometric lip features with application to multi-modal speaker identification

Arsic, I.; Vilagut Abad, R.; Thiran, J.

doi:10.1109/ICME.2006.262594

conference paper

Automatic extraction of geometric lip features with application to multi-modal speaker identification

Arsic, I.

•

Vilagut Abad, R.

•

Thiran, J.

2006

Proceedings of the IEEE International Conference on Multimedia and Expo (ICME) 2006

In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, [1] showing promising results.