Automatic extraction of geometric lip features with application to multi-modal speaker identification

In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, [1] showing promising results.


Published in:
Proc. of the IEEE International Conference on Multimedia and Expo (ICME) 2006
Year:
2006
Publisher:
IEEE
Keywords:
Laboratories:




 Record created 2006-06-14, last modified 2018-03-17

n/a:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)