The segmentation of multi-channel meeting recordings for automatic speech recognition

Dines, John; Vepa, Jithendra; Hain, Thomas

doi:10.21437/Interspeech.2006-366

Dines, John; Vepa, Jithendra; Hain, Thomas

2006

Télécharger

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Fichiers

Résumé

One major research challenge in the domain of the analysis of meeting room data is the automatic transcription of what is spoken during meetings, a task which has gained considerable attention within the ASR research community through the NIST rich transcription evaluations conducted over the last three years. One of the major difficulties in carrying out automatic speech recognition (ASR) on this data is dealing with the challenging recording environment, which has instigated the development of novel audio pre-processing approaches. In this paper we present a system for the automatic segmentation of multiple-channel individual headset microphone (IHM) meeting recordings for automatic speech recognition. The system relies on an MLP classifier trained from several meeting room corpora to identify speech/non-speech segments of the recordings. We give a detailed analysis of the segmentation performance for a number of system configurations, with our best system achieving ASR performance on automatically generated segments within 1.3\% (3.7\% relative) of a manual segmentation of the data.

Détails

Titre The segmentation of multi-channel meeting recordings for automatic speech recognition

Auteur(s) Dines, John ; Vepa, Jithendra ; Hain, Thomas

Publié dans Interspeech 2006

Pages 1548-Tue3A1O.4

Présenté à Int. Conf. on Spoken Language Processing (Interspeech ICSLP)

Date 2006

Editeur Pittsburgh, USA

Note IDIAP-RR 06-22

DOI https://doi.org/10.21437/Interspeech.2006-366

Lien supplémentaire URL; Related documents

Laboratoires LIDIAP

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIDIAP - Laboratoire de l'IDIAP
Production scientifique et compétences > Euler Center for Signal Processing
Papiers de conférence
Travail produit à l'EPFL
Publié

Date de création de la notice 2010-02-11

Actions

Aperçu

Sélectionner le fichier :