Blind Audiovisual Source Separation Using Sparse Representations

Llagostera Casanovas, Anna; Monaci, Gianluca; Vandergheynst, Pierre

doi:10.1109/ICIP.2007.4379306

Llagostera Casanovas, Anna; Monaci, Gianluca; Vandergheynst, Pierre

2007

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this work we present a method to jointly separate active audio and visual structures on a given mixture. Blind Audiovisual Source Separation is achieved exploiting the coherence between a video signal and a one-microphone audio track. The efficient representation of audio and video sequences allows to build relationships between correlated structures on both modalities. Video structures exhibiting strong correlations with the audio signal and that are spatially close are grouped using a robust clustering algorithm that can count and localize audiovisual sources. Using such information and exploiting audio-video correlation, audio sources are also localized and separated. To the best of our knowledge this is the first blind audiovisual source separation algorithm conceived to deal with a video sequence and the corresponding mono audio signal.

Details

Title Blind Audiovisual Source Separation Using Sparse Representations

Author(s) Llagostera Casanovas, Anna ; Monaci, Gianluca ; Vandergheynst, Pierre

Published in 2007 IEEE International Conference on Image Processing

Volume 3

Pages 301-304

Conference IEEE International Conference on Image Processing, San Antonio, Texas, USA, September, 16-19, 2007

Date 2007

Keywords

lts2; LTS2; Audiovisual processing; Blind source separation; Sparse signal; representation

Note ITS

DOI https://doi.org/10.1109/ICIP.2007.4379306

Other identifier(s) View record in Web of Science

Laboratories LTS2

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS2 - Signal Processing Laboratory 2
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Record creation date 2007-01-24

Actions

Preview

Select file: