Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Unsupervised Extraction of Audio-Visual Objects
 
conference paper

Unsupervised Extraction of Audio-Visual Objects

Llagostera Casanovas, Anna  
•
Vandergheynst, Pierre  
2011
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP)

We propose a novel method to automatically detect and extract the video modality of the sound sources that are present in a scene. For this purpose, we first assess the synchrony between the moving objects captured with a video camera and the sounds recorded by a microphone. Next, video regions presenting a high coherence with the soundtrack are automatically labelled as being part of the source. This represents the starting point for an innovative video segmentation approach, whose objective is to extract the complete audio-visual object. The proposed graph-cut segmentation procedure includes an audio-visual term that links together pixels in regions with high audio-video coherence. Our approach is demonstrated on challenging sequences presenting non-stationary sound sources and distracting moving objects.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2011.5946938
Web of Science ID

WOS:000296062402151

Author(s)
Llagostera Casanovas, Anna  
Vandergheynst, Pierre  
Date Issued

2011

Publisher

Ieee Service Center, 445 Hoes Lane, Po Box 1331, Piscataway, Nj 08855-1331 Usa

Published in
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

2284

End page

2287

Subjects

audio-visual processing

•

graph cuts

•

LTS2

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LTS2  
Event nameEvent placeEvent date
IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP)

Prague, Czech Republic

May 22-27, 2011

Available on Infoscience
October 21, 2010
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/55962
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés