Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. In the Mood for Vlog: Multimodal Inference in Conversational Social Video
 
research article

In the Mood for Vlog: Multimodal Inference in Conversational Social Video

Sanchez-Cortes, Dairazalia
•
Kumano, Shiro
•
Otsuka, Kazuhiro
Show more
2015
ACM Transactions on Interactive Intelligent Systems

The prevalent “share what’s on your mind” paradigm of social media can be examined from the perspective of mood: short-term affective states revealed by the shared data. This view takes on new relevance given the emergence of conversational social video as a popular genre among viewers looking for entertainment and among video contributors as a channel for debate, expertise sharing, and artistic expression. From the perspective of human behavior understanding, in conversational social video both verbal and nonverbal in- formation is conveyed by speakers and decoded by viewers. We present a systematic study of classification and ranking of mood impressions in social video, using vlogs from YouTube. Our approach considers eleven natural mood categories labeled through crowdsourcing by external observers on a diverse set of conversa- tional vlogs. We extract a comprehensive number of nonverbal and verbal behavioral cues from the audio and video channels to characterize the mood of vloggers. Then we implement and validate vlog classification and vlog ranking tasks using supervised learning methods. Following a reliability and correlation analysis of the mood impression data, our study demonstrates that, while the problem is challenging, several mood categories can be inferred with promising performance. Furthermore, multimodal features perform consis- tently better than single channel features. Finally, we show that addressing mood as a ranking problem is a promising practical direction for several of the mood categories studied.

  • Details
  • Metrics
Type
research article
DOI
10.1145/2641577
Web of Science ID

WOS:000361248300004

Author(s)
Sanchez-Cortes, Dairazalia
Kumano, Shiro
Otsuka, Kazuhiro
Gatica-Perez, Daniel  
Date Issued

2015

Publisher

Assoc Computing Machinery

Published in
ACM Transactions on Interactive Intelligent Systems
Volume

5

Issue

2

Start page

9

Subjects

Social video

•

mood

•

video blogs

•

vlogs

•

nonverbal behavior

•

verbal content

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
May 19, 2015
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/113988
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés