Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares
 
conference paper

Joint estimation of RETF vector and power spectral densities for speech enhancement based on alternating least squares

Tammen, Marvin
•
Kodrasi, Ina
•
Doclo, Simon
2019
IEEE International Conference on Acoustics, Speech and Signal Processing
IEEE International Conference on Acoustics, Speech and Signal

The multi-channel Wiener filter (MWF) is a well-known multi-microphone speech enhancement technique, aiming at improving the quality of the recorded speech signals in noisy and reverberant environments. Assuming that reverberation and ambient noise can be modeled as a diffuse sound field and the spatial coherence of the residual noise is known, the MWF requires estimates of the relative early transfer function (RETF) vector of the target speaker as well as the power spectral densities (PSDs) of the target, diffuse and residual noise component. RETF vector and PSD estimation is often decoupled, where one quantity is estimated independently of the other quantity. In this paper, we propose to jointly estimate the RETF vector and all PSDs by minimizing the Frobenius norm of a model-based error matrix using an alternating least squares method. Experimental results using different dynamic acoustic scenarios with a moving speaker show that the proposed method leads to a larger MWF performance than a state-of-the-art method based on covariance whitening.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2019.8683321
Author(s)
Tammen, Marvin
Kodrasi, Ina
Doclo, Simon
Date Issued

2019

Published in
IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

795

End page

799

Written at

EPFL

EPFL units
LIDIAP  
Event name
IEEE International Conference on Acoustics, Speech and Signal
Available on Infoscience
February 18, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/166356
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés