Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition

Asaei, Afsaneh; Bourlard, Hervé; Cevher, Volkan

doi:10.1109/ICASSP.2011.5947379

conference paper

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition

Asaei, Afsaneh

•

Bourlard, Hervé

•

Cevher, Volkan

2011

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The 36th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.

Name

BSS-MSR.pdf

Access type

openaccess

Size

143.69 KB

Format

Adobe PDF

Checksum (MD5)

d66dbc0580fd109420222ad52eee3ace