Introducing Crossmodal Biometrics: Person Identification from Distinct Audio & Visual Streams
Person identification using audio or visual biometrics is a well-studied problem in pattern recognition. In this scenario, both training and testing are done on the same modalities. However, there can be situations where this condition is not valid, i.e. training and testing has to be done on different modalities. This could arise, for example, in covert surveillance. Is there any person specific information common to both the audio and visual (video-only) modalities which could be exploited to identify a person in such a constrained situation? In this work, we investigate this question in a principled way and propose a framework which can perform this task consistently better than chance, suggesting that such crossmodal biometric information exists.
Roy_Idiap-RR-29-2010.pdf
openaccess
468.8 KB
Adobe PDF
d5684ad7f58f14ac009277ffb2cbe1c0