Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations

Parthasarathi, Sree Hari Krishnan; Magimai.-Doss, Mathew; Bourlard, Hervé; Gatica-Perez, Daniel

Parthasarathi, Sree Hari Krishnan; Magimai.-Doss, Mathew; Bourlard, Hervé; Gatica-Perez, Daniel

2009

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

We investigate four different privacy-sensitive features, namely energy, zero crossing rate, spectral flatness, and kurtosis, for speech detection in multiparty conversations. We liken this scenario to a meeting room and define our datasets and annotations accordingly. The temporal context of these features is modeled. With no temporal context, energy is the best performing single feature. But by modeling temporal context, kurtosis emerges as the most effective feature. Also, we combine the features. Besides yielding a gain in performance, certain combinations of features also reveal that a shorter temporal context is sufficient. We then benchmark other privacy-sensitive features utilized in previous studies. Our experiments show that the performance of all the privacy-sensitive features modeled with context is close to that of state-of-the-art spectral-based features, without extracting and using any features that can be used to reconstruct the speech signal.

Details

Title Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations

Author(s) Parthasarathi, Sree Hari Krishnan ; Magimai.-Doss, Mathew ; Bourlard, Hervé ; Gatica-Perez, Daniel

Date 2009

Publisher Idiap Research Institute, Martigny, Switzerland., Idiap

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2010-02-11

Actions

Preview

Select file: