Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification

Roy, Anindya; Marcel, Sébastien

doi:10.1145/1774088.1774407

Roy, Anindya; Marcel, Sébastien

2010

Download

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, where the object instances are 1-D short-time spectral vectors obtained from the speech signal. More precisely, we investigate the general problem of speaker verification in the presence of additive white Gaussian noise, which we consider as analogous to visual object detection under varying illumination conditions. Inspired by their recent success in illumination-robust object detection, we apply a certain class of binary-valued pixel-pair based features called Ferns for noise-robust speaker verification. Intensive experiments on a benchmark database according to a standard evaluation protocol have shown the advantage of the proposed features in the presence of moderate to extremely high amounts of additive noise.

Details

Title Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification

Author(s) Roy, Anindya ; Marcel, Sébastien

Published in SAC '10: Proceedings of the 2010 ACM Symposium on Applied Computing

Pages 1491–1495

Conference Association for Computing Machinery - ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland

Date 2010

DOI https://doi.org/10.1145/1774088.1774407

Additional link URL; Related documents

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL
Published

Record creation date 2010-02-11

Actions

Preview

Select file: