report
Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition
2011
We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.
Type
report
Author(s)
Date Issued
2011
Publisher
Idiap
Written at
EPFL
EPFL units
Available on Infoscience
May 19, 2011
Use this identifier to reference this record