Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Multimodal Speaker Localization from Omnidirectional Videos
 
conference paper

Multimodal Speaker Localization from Omnidirectional Videos

Reuse, Pascal
•
Gurban, Mihai
•
Austvoll, Ivar
Show more
2009
Proceedings of the 17th European Signal Processing Conference
17th European Signal Processing Conference

The use of omnidirectional cameras for videoconferencing promises to simplify the hardware setup necessary for large groups of participants. We investigate the use of a multimodal speaker detection algorithm on audio-visual sequences captured with such a camera, in particular, an algorithm that uses the audio energy together with the optical flow. We analyze several types of optical flow methods to determine the one which is appropriate to the omnidirectional context.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

omni.pdf

Access type

openaccess

Size

514.8 KB

Format

Adobe PDF

Checksum (MD5)

c621839fdebe6bf79f375e1a92133957

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés