Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Visual Speaker Localization Aided by Acoustic Models
 
conference paper

Visual Speaker Localization Aided by Acoustic Models

Friedland, Gerald
•
Yeo, Chuohao
•
Hung, Hayley
2009
MM '09: Proceedings of the 17th ACM international conference on Multimedia
ACM Multimedia

The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single far-field microphone, a state-of-the art audio-only speaker localization system (traditionally called speaker diarization) is extended so that both acoustic and visual models are estimated as part of a joint unsupervised optimization problem. The speaker diarization system first automatically determines the number of speakers and estimates “who spoke when”, then, in a second step, the visual models are used to infer the location of the speakers in the video. The experiments were performed on real-world meetings using 4.5 hours of the publicly available AMI meeting corpus. The proposed system is able to exploit audio-visual integration to not only improve the accuracy of a state-of-the-art (audioonly) speaker diarization, but also adds visual speaker locationing at little incremental engineering and computation costs.

  • Details
  • Metrics
Type
conference paper
DOI
10.1145/1631272.1631301
Author(s)
Friedland, Gerald
Yeo, Chuohao
Hung, Hayley
Date Issued

2009

Published in
MM '09: Proceedings of the 17th ACM international conference on Multimedia
Start page

195

End page

202

Written at

EPFL

EPFL units
LIDIAP  
Event name
ACM Multimedia
Available on Infoscience
February 11, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/46741
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés