Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Reports, Documentation, and Standards
  4. Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios
 
report

Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios

Parthasarathi, Sree Hari Krishnan  
•
Magimai.-Doss, Mathew  
•
Bourlard, Hervé  
Show more
2010

Personal audio logs are often recorded in multiple environments. This poses challenges for robust front-end processing, including speech/nonspeech detection (SND). Motivated by this, we investigate the robustness of four different privacy-sensitive features for SND, namely energy, zero crossing rate, spectral flatness, and kurtosis. We study early and late fusion of these features in conjunction with modeling temporal context. These combinations are evaluated in mismatched conditions on a dataset of nearly 450 hours. While both combinations yielded improvements over individual features, generally feature combinations performed better. Comparisons with a state-of-the-art spectral based and a privacy-sensitive feature set are also provided.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Parthasarathi_Idiap-RR-01-2010.pdf

Access type

openaccess

Size

445.8 KB

Format

Adobe PDF

Checksum (MD5)

87d0f108f2b53410a294c0d1ab94aca2

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés