Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. A Tractable Framework for Estimating and Combining Spectral Source Models for Audio Source Separation
 
Loading...
Thumbnail Image
research article

A Tractable Framework for Estimating and Combining Spectral Source Models for Audio Source Separation

Arberet, Simon  
•
Ozerov, Alexey
•
Bimbot, Frédéric  
Show more
2012
Signal Processing

The underdetermined blind audio source separation (BSS) problem is often addressed in the time-frequency (TF) domain assuming that each TF point is modeled as an independent random variable with sparse distribution. On the other hand, methods based on structured spectral model, such as the Spectral Gaussian Scaled Mixture Models (Spectral-GSMMs) or Spectral Non-negative Matrix Factorization models, perform better because they exploit the statistical diversity of audio source spectrograms, thus allowing to go beyond the simple sparsity assumption. However, in the case of discrete state-based models, such as Spectral-GSMMs, learning the models from the mixture can be computationally very expensive. One of the main problems is that using a classical Expectation-Maximization procedure often leads to an exponential complexity with respect to the number of sources. In this paper, we propose a framework with a linear complexity to learn spectral source models (including discrete state-based models) from noisy source estimates. Moreover, this framework allows combining different probabilistic models that can be seen as a sort of probabilistic fusion. We illustrate that methods based on this framework can significantly improve the BSS performance compared to the state-of-the-art approaches. (c) 2012 Elsevier B.V. All rights reserved.

  • Files
  • Details
  • Metrics
Type
research article
DOI
10.1016/j.sigpro.2011.12.022
Web of Science ID

WOS:000303080700013

Author(s)
Arberet, Simon  
•
Ozerov, Alexey
•
Bimbot, Frédéric  
•
Gribonval, Rémi  
Date Issued

2012

Published in
Signal Processing
Volume

92

Issue

8

Start page

1886

End page

1901

Subjects

Blind source separation

•

multichannel audio

•

Gaussian mixture model

•

expectation-maximization algorithm

•

convolutive mixture

•

LTS2

Note

Special issue on "Latent Variable Analysis and Signal Separation"

Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LTS2  
Available on Infoscience
May 18, 2011
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/67496
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés