Blind Audiovisual Source Separation Using Sparse Redundant Representations

Llagostera Casanovas, Anna; Monaci, Gianluca; Vandergheynst, Pierre

Llagostera Casanovas, Anna; Monaci, Gianluca; Vandergheynst, Pierre

2007

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

In this work, we present a method that jointly separates active audio and visual structures on a given mixture. This new concept, the Blind Audiovisual Source Separation (BAVSS), is achieved by exploiting the coherence existing between the recorded signal of a video camera and only one microphone. An efficient representation of audio and video sequences allows to build robust audiovisual relationships between temporally correlated structures of both modalities or, what turns to be the same, two parts of the same audiovisual event. First, video sources are localized and separated on the image sequence exploiting the temporal occurrence of audiovisual events and using a spatial clustering algorithm, without necessity of any previous assumption about the number of sources in the mixture. Second, the same audiovisual relationships together with a time-frequency probabilistic analysis allow the separation of the audio sources in the soundtrack, and, consequently, the complete Audiovisual Separation.

Détails

Titre Blind Audiovisual Source Separation Using Sparse Redundant Representations

Auteur(s) Llagostera Casanovas, Anna ; Monaci, Gianluca ; Vandergheynst, Pierre

Date 2007

Mots-clés (libres)

Audiovisual processing; Blind source separation; Sparse signal representation; LTS2

Note ITS

Laboratoires LTS2

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LTS2 - Laboratoire de traitement des signaux 2
Travail produit à l'EPFL
Rapports techniques
Publié

Date de création de la notice 2007-01-25

Actions

Aperçu

Sélectionner le fichier :