Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification
 
conference paper

Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification

Roy, Anindya  
•
Marcel, Sébastien  
2010
SAC '10: Proceedings of the 2010 ACM Symposium on Applied Computing
Association for Computing Machinery - ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland

In this paper, we consider the problem of speaker verification as a two-class object detection problem in computer vision, where the object instances are 1-D short-time spectral vectors obtained from the speech signal. More precisely, we investigate the general problem of speaker verification in the presence of additive white Gaussian noise, which we consider as analogous to visual object detection under varying illumination conditions. Inspired by their recent success in illumination-robust object detection, we apply a certain class of binary-valued pixel-pair based features called Ferns for noise-robust speaker verification. Intensive experiments on a benchmark database according to a standard evaluation protocol have shown the advantage of the proposed features in the presence of moderate to extremely high amounts of additive noise.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Roy_ACMSAC2010_2010.pdf

Access type

openaccess

Size

130.54 KB

Format

Adobe PDF

Checksum (MD5)

228ed4ecbb56444e2421dc1d92a7764c

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés