Audio-driven Nonlinear Video Diffusion

Llagostera Casanovas, Anna; Vandergheynst, Pierre

Llagostera Casanovas, Anna; Vandergheynst, Pierre

2011

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper we present a novel nonlinear video diffusion approach based on the fusion of information in audio and video channels. Both modalities are efficiently combined into a diffusion coefficient that integrates the basic assumption in this domain, i.e. related events in audio and video channels occur approximately at the same time. The proposed diffusion coefficient depends thus on an estimate of the synchrony between sounds and video motion. As a result, information in video parts whose motion is not coherent with the soundtrack is reduced and the sound sources are automatically highlighted. Several tests on challenging real-world sequences presenting important auditive and/or visual distractors demonstrate that our approach is able to prevail regions which are related to the soundtrack. In addition, we propose an application to the extraction of audio-related video regions by unsupervised segmentation in order to illustrate the capabilities of our method. To the best of our knowledge, this is the first nonlinear video diffusion approach which integrates information from the audio modality.

Details

Title Audio-driven Nonlinear Video Diffusion

Author(s) Llagostera Casanovas, Anna ; Vandergheynst, Pierre

Published in IEEE Transactions on Image Processing

Date 2011

Publisher Institute of Electrical and Electronics Engineers

ISSN 1057-7149

Keywords

Audio-visual processing; linear/nonlinear diffusion; graph cut segmentation

Laboratories LTS2

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS2 - Signal Processing Laboratory 2
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Submitted

Record creation date 2011-10-10

Actions

Preview

Select file: