Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations
 
conference paper

Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations

Aran, Oya
•
Gatica-Perez, Daniel  
2010
2010 20th International Conference on Pattern Recognition
20th International Conference on Pattern Recognition

This paper addresses the multimodal nature of social dominance and presents multimodal fusion techniques to combine audio and visual nonverbal cues for dominance estimation in small group conversations. We combine the two modalities both at the feature extraction level and at the classifier level via score and rank level fusion. The classification is done by a simple rule-based estimator. We perform experiments on a new 10-hour dataset derived from the popular AMI meeting corpus. We objectively evaluate the performance of each modality and each cue alone and in combination. Our results show that the combination of audio and visual cues is necessary to achieve the best performance.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Aran_ICPR2010_2010.pdf

Access type

openaccess

Size

190.27 KB

Format

Adobe PDF

Checksum (MD5)

e47c174e5a7a87f5876cb8effc0d9d71

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés