Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition

Asaei, Afsaneh; Bourlard, Hervé; Cevher, Volkan

doi:10.1109/ICASSP.2011.5947379

conference paper

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition

Asaei, Afsaneh

•

Bourlard, Hervé

•

Cevher, Volkan

2011

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The 36th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.

Type

conference paper

DOI

10.1109/ICASSP.2011.5947379

Author(s)

Asaei, Afsaneh

Bourlard, Hervé

Cevher, Volkan

Date Issued

2011

Publisher

IEEE Service Center, 445 Hoes Lane, PO Box 1331, Piscataway, NJ 08855-1331 USA

Published in

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Start page

4600

End page

4603

Subjects

Model-Based Compressive Sensing

•

Multi-party Speech Recognition

•

Convolutive Overlapping Speech

•

Sparse Component Analysis

•

Sparse Signal Recovery

Note

awarded by IEEE Spoken Language Processing

Editorial or Peer reviewed

NON-REVIEWED

Written at

EPFL

EPFL units

LIDIAP

LIONS

Event name	Event place	Event date
The 36th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)	Prague, Czech Republic	May 22-27, 2011

Available on Infoscience

May 1, 2012

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/79789