Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media
 
conference paper

Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media

Le, Nam
•
Odobez, Jean-Marc  
2016
Mm'16: Proceedings Of The 2016 Acm Multimedia Conference
ACM Multimedia

Person discovery in the absence of prior identity knowledge requires accurate association of visual and auditory cues. In broadcast data, multimodal analysis faces additional challenges due to narrated voices over muted scenes or dubbing in different languages. To address these challenges, we define and analyze the problem of dubbing detection in broadcast data, which has not been explored before. We propose a method to represent the temporal relationship between the auditory and visual streams. This method consists of canonical correlation analysis to learn a joint multimodal space, and long short term memory (LSTM) networks to model cross-modality temporal dependencies. Our contributions also include the introduction of a newly acquired dataset of face-speech segments from TV data, which we have made publicly available. The proposed method achieves promising performance on this real world dataset as compared to several baselines.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1145/2964284.2967211
Web of Science ID

WOS:000387733800021

Author(s)
Le, Nam
Odobez, Jean-Marc  
Date Issued

2016

Publisher

ACM

Publisher place

New York

Published in
Mm'16: Proceedings Of The 2016 Acm Multimedia Conference
ISBN of the book

978-1-4503-3603-1

Total of pages

5

Start page

202

End page

206

Subjects

Multimodal

•

Person Diarization

•

Recurrent Neural Networks

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent place
ACM Multimedia

Amsterdam

Available on Infoscience
August 19, 2016
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/128773
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés