A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

Durrieu, Jean-Louis; David, Bertrand; Richard, Gaël

doi:10.1109/JSTSP.2011.2158801

Durrieu, Jean-Louis; David, Bertrand; Richard, Gaël

2011

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

When designing an audio processing system, the target tasks often influence the choice of a data representation or transformation. Low-level time-frequency representations such as the short-time Fourier transform (STFT) are popular, because they offer a meaningful insight on sound properties for a low computational cost. Conversely, when higher level semantics, such as pitch, timbre or phoneme, are sought after, representations usually tend to enhance their discriminative characteristics, at the expense of their invertibility. They become so-called mid-level representations. In this paper, a source/filter signal model which provides a mid-level representation is proposed. This representation makes the pitch content of the signal as well as some timbre information available, hence keeping as much information from the raw data as possible. This model is successfully used within a main melody extraction system and a lead instrument/accompaniment separation system. Both frameworks obtained top results at several international evaluation campaigns.

Details

Title A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation

Author(s) Durrieu, Jean-Louis ; David, Bertrand ; Richard, Gaël

Published in IEEE Journal of Selected Topics in Signal Processing

Volume 5

Issue 6

Pages 1180-1191

Date 2011

ISSN 1941-0484

Keywords

Audio melody extraction; audio signal representation; musical audio source separation; non-negative matrix factorization (NMF); pitch estimation; Nonnegative Matrix Factorization; Polyphonic Music; Signals; Melody; Identification; Transcription; Similarity; Sounds

DOI https://doi.org/10.1109/JSTSP.2011.2158801

Other identifier(s) View record in Web of Science

Additional link URL

Laboratories LTS5

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS5 - Signal Processing Laboratory 5
Peer-reviewed publications
Work produced at EPFL
Journal Articles
Published

Record creation date 2011-09-27

Files

Abstract

Details

Actions