Fusion of audio and video information for multi modal person authentication

Duc, Benoît; Bigün, Elizabeth Saers; Bigün, Josef; Maître, Gilbert; Fischer, Stefan

doi:10.1016/S0167-8655(97)00071-8

research article

Fusion of audio and video information for multi modal person authentication

Duc, Benoît

•

Bigün, Elizabeth Saers

•

Bigün, Josef

more

1997

Pattern Recognition Letters

We present an algorithm functioning as a supervisor module in a multi-expert decision making machine. It uses the Bayes theory in order to estimate the biases of individual expert opinions. The biases are used to calibrate and conciliate expert opinions to a single decision. This supervision technique is applied to the real case of a person authentication technique using two modalities, face and speech. The visual part involves the matching of a coarse grid containing Gabor phase information from face images. The acoustic part is performed by a text-dependent speaker verification system based on Hidden Markov Models. Experimental results show that the proposed fusion method improves the quality of individual expert decisions by reaching success rates of 99.5 %

Type

research article

DOI

10.1016/S0167-8655(97)00071-8

Web of Science ID

WOS:000071402300003

Authors

Duc, Benoît

•

Bigün, Elizabeth Saers

•

Bigün, Josef

•

Maître, Gilbert

•

Fischer, Stefan

Publication date

1997

Published in

Pattern Recognition Letters

Volume

18

Issue

9

Start page

835

End page

843

Subjects

vision

Peer reviewed

REVIEWED

EPFL units

LIDIAP

Available on Infoscience

March 10, 2006

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/227752