On-the-Fly Audio Source Separation-A Novel User-Friendly Framework

El Badawy, Dalia; Duong, Ngoc Q. K.; Ozerov, Alexey

doi:10.1109/Taslp.2016.2632528

El Badawy, Dalia; Duong, Ngoc Q. K.; Ozerov, Alexey

2017

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

This paper addresses the challenging problem of single-channel audio source separation. We introduce a novel userguided framework where source models that govern the separation process are learned on-the-fly from audio examples retrieved online. The user only provides the search keywords that describe the sources in the mixture. In this framework, the generic spectral characteristics of each source are modeled by a universal sound class model learned from the retrieved examples via nonnegative matrix factorization. We propose several group sparsity-inducing constraints in order to efficiently exploit a relevant subset of the universal model adapted to the mixture to be separated. We then derive the corresponding multiplicative update rules for parameter estimation. Separation results obtained from automated and user tests on mixtures containing various types of sounds confirm the effectiveness of the proposed framework.

Details

Title On-the-Fly Audio Source Separation-A Novel User-Friendly Framework

Author(s) El Badawy, Dalia ; Duong, Ngoc Q. K. ; Ozerov, Alexey

Published in IEEE/ACM Transactions on Audio, Speech and Language Processing

Pagination 12

Volume 25

Issue 2

Pages 261-272

Date 2017

ISSN 2329-9290

Keywords

Group sparsity; non-negative matrix factorization; on-the-fly audio source separation; universal sound class model; user-guided

DOI https://doi.org/10.1109/Taslp.2016.2632528

Other identifier(s) View record in Web of Science

Laboratories IINFCOM

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > UNATTRIBUTED-IINFCOM - IINFCOM - Unattributed publications
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2017-03-27

Abstract

Details

Actions