Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition
 
conference paper

Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition

Asaei, Afsaneh  
•
Bourlard, Hervé  
•
Cevher, Volkan  orcid-logo
2011
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
The 36th International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

We leverage the recent algorithmic advances in compressive sensing, and propose a novel source separation algorithm for efficient recovery of convolutive speech mixtures in spectro-temporal domain. Compared to the common sparse component analysis techniques, our approach fully exploits structured sparsity models to obtain substantial improvement over the existing state-of-the-art. We evaluate our method for separation and recognition of a target speaker in a multi-party scenario. Our results provide compelling evidence of the effectiveness of sparse recovery formulations in speech recognition.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

BSS-MSR.pdf

Access type

openaccess

Size

143.69 KB

Format

Adobe PDF

Checksum (MD5)

d66dbc0580fd109420222ad52eee3ace

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés