Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios

Parthasarathi, Sree Hari Krishnan; Magimai.-Doss, Mathew; Bourlard, Hervé; Gatica-Perez, Daniel

doi:10.1109/ICASSP.2010.5495596

Parthasarathi, Sree Hari Krishnan; Magimai.-Doss, Mathew; Bourlard, Hervé; Gatica-Perez, Daniel

2010

Télécharger

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

Personal audio logs are often recorded in multiple environments. This poses challenges for robust front-end processing, including speech/nonspeech detection (SND). Motivated by this, we investigate the robustness of four different privacy-sensitive features for SND, namely energy, zero crossing rate, spectral flatness, and kurtosis. We study early and late fusion of these features in conjunction with modeling temporal context. These combinations are evaluated in mismatched conditions on a dataset of nearly 450 hours. While both combinations yield improvements over individual features, generally feature combinations perform better. Comparisons with a state-of-the-art spectral based and a privacy-sensitive feature set are also provided.

Détails

Titre Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios

Auteur(s) Parthasarathi, Sree Hari Krishnan ; Magimai.-Doss, Mathew ; Bourlard, Hervé ; Gatica-Perez, Daniel

Publié dans 2010 IEEE International Conference on Acoustics, Speech and Signal Processing

Pages 4474-4477

Présenté à ICASSP 2010

Date 2010

DOI https://doi.org/10.1109/ICASSP.2010.5495596

Lien supplémentaire URL

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2010-02-11

Files

Résumé

Détails

PDF