Multi-party Speech Recovery Exploiting Structured Sparsity Models

Asaei, Afsaneh; Taghizadeh, Mohammad J.; Bourlard, Hervé; Cevher, Volkan

doi:10.21437/Interspeech.2011-78

conference paper

Multi-party Speech Recovery Exploiting Structured Sparsity Models

Asaei, Afsaneh

•

Taghizadeh, Mohammad J.

•

Bourlard, Hervé

more

2011

Proceedings of Interspeech

We study the sparsity of spectro-temporal representation of speech in reverberant acoustic conditions. This study motivates the use of structured sparsity models for efficient speech recovery. We formulate the underdetermined convolutive speech separation in spectro-temporal domain as the sparse signal recovery where we leverage model-based recovery algorithms. To tackle the ambiguity of the real acoustics, we exploit the Image Model of the enclosures to estimate the room impulse response function through a structured sparsity constraint optimization. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech applications.

Name

Asaei_INTERSPEECH_2011.pdf

Access type

openaccess

Size

74.2 KB

Format

Adobe PDF

Checksum (MD5)

f42ddb9c26759174c419f3789d1cea29