Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization

Asaei, Afsaneh; Taghizadeh, Mohammadjavad; Haghighatshoar, Saeid; Raj, Bhiksha; Bourlard, Hervé; Cevher, Volkan

doi:10.1109/Tsp.2015.2488598

Asaei, Afsaneh; Taghizadeh, Mohammadjavad; Haghighatshoar, Saeid; Raj, Bhiksha; Bourlard, Hervé; Cevher, Volkan

2016

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We propose a sparse coding approach to address the problem of source-sensor localization and speech reconstruction. This approach relies on designing a dictionary of spatialized signals by projecting the microphone array recordings into the array manifolds characterized for different locations in a reverberant enclosure using the image model. Sparse representation over this dictionary enables identifying the subspace of the actual recordings and its correspondence to the source and sensor locations. The speech signal is reconstructed by inverse filtering the acoustic channels associated to the array manifolds. We provide rigorous analysis on the optimality of speech reconstruction by elucidating the links between inverse filtering and source separation followed by deconvolution. This procedure is evaluated for localization, reconstruction and recognition of simultaneous speech sources using real data recordings. The results demonstrate the effectiveness of the proposed approach and compare favorably against beamforming and independent component analysis techniques.

Details

Title Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization

Author(s) Asaei, Afsaneh ; Taghizadeh, Mohammadjavad ; Haghighatshoar, Saeid ; Raj, Bhiksha ; Bourlard, Hervé ; Cevher, Volkan

Published in Ieee Transactions On Signal Processing

Volume 64

Issue 3

Pages 567-579

Date 2016

Publisher Piscataway, Institute of Electrical and Electronics Engineers

ISSN 1053-587X

Keywords

Microphone array; multiparty (overlapping) speech recognition; reverberation; sound spatialization; source localization and separation; Sparse coding; Reverberation; Source localiza- tion and separation; Multiparty (Overlapping) Speech recogni- tion; Microphone array; Sound Spatialization

DOI https://doi.org/10.1109/Tsp.2015.2488598

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2015-09-19

Actions

Preview

Select file: