Model-based Compressive Sensing for Multi-party Distant Speech Recognition

Asaei, Afsaneh; Bourlard, Hervé; Cevher, Volkan

doi:10.1109/ICASSP.2011.5947379

conference paper not in proceedings

Model-based Compressive Sensing for Multi-party Distant Speech Recognition

Asaei, Afsaneh

•

Bourlard, Hervé

•

Cevher, Volkan

2011

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.

Name

Asaei_ICASSP_2011.pdf

Access type

openaccess

Size

143.57 KB

Format

Adobe PDF

Checksum (MD5)

c8c4148c980c5bdbf32f0e9f4e000bb9