Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. Privacy-Sensitive Audio Features for Speech/Nonspeech Detection
 
report

Privacy-Sensitive Audio Features for Speech/Nonspeech Detection

Parthasarathi, Sree Hari Krishnan  
•
Gatica-Perez, Daniel  
•
Bourlard, Hervé  
Show more
2011

The goal of this paper is to investigate features for speech/nonspeech detection (SND) having ``minimal'' linguistic information from the speech signal. Towards this, we present a comprehensive study of privacy-sensitive features for SND in multiparty conversations. Our study investigates three different approaches to privacy-sensitive features. These approaches are based on: (a) simple, instantaneous feature extraction methods; (b) excitation source information based methods; and (c) feature obfuscation methods such as local (within 130 ms) temporal averaging and randomization applied on excitation source information. To evaluate these approaches for SND, we use multiparty conversational meeting data of nearly 450 hours. On this dataset, we evaluate these features and benchmark them against state-of-the-art spectral shape based features such as Mel-Frequency Perceptual Linear Prediction (MF-PLP). Fusion strategies combining excitation source with simple features show that state-of-the-art performance can be obtained in both close-talking and far-field microphone scenarios. As one way to quantify and evaluate the notion of privacy, we conduct Automatic Speech Recognition (ASR) studies on TIMIT. While excitation source features yield phoneme recognition accuracies in between the simple features and the MF-PLP features, obfuscation methods applied on the excitation features yield low phoneme accuracies in conjunction with state-of-the-art SND performance.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Parthasarathi_Idiap-RR-12-2011.pdf

Access type

openaccess

Size

735.68 KB

Format

Adobe PDF

Checksum (MD5)

a3fab1861c9d23f8cafd6767689bcc67

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés