Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Multi-party Speech Recovery Exploiting Structured Sparsity Models
 
conference paper

Multi-party Speech Recovery Exploiting Structured Sparsity Models

Asaei, Afsaneh  
•
Taghizadeh, Mohammad
•
Bourlard, Hervé  
Show more
2011
Proceeding of Interspeech
12th Annual Conference of the International Speech Communication Association

We study the sparsity of spectro-temporal representation of speech in reverberant acoustic conditions. This study motivates the use of structured sparsity models for efficient speech recovery. We formulate the underdetermined convolutive speech separation in spectro-temporal domain as the sparse signal recovery where we leverage model-based recovery algorithms. To tackle the ambiguity of the real acoustics, we exploit the Image model of the enclosures to estimate the room impulse response function through a structured sparsity constraint optimization. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech applications.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

StructuredSparsity_Speech.pdf

Access type

openaccess

Size

83.28 KB

Format

Adobe PDF

Checksum (MD5)

f11fc217791855f8876ac466bde9bb90

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés