Towards using slide information to enhance speech transcription of meetings

Peregoudov, Artem; Vinciarelli, Alessandro; Bourlard, Hervé

Peregoudov, Artem; Vinciarelli, Alessandro; Bourlard, Hervé

2006

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper we investigate the possibility of improving the speech recognition performance of meeting recordings by using slides captured during the recording process. The key hypothesis exploited in this work is that both slides and speech carry correlated contextual and semantic information. Thus, we propose an approach using the information extracted from slides aimed at reducing the speech recognition word error rate. The N-Best lists output by the recogniser are rescored through Information Retrieval techniques to maximise the similarity between speech and slides transcripts. Results obtained on three meeting recordings (for a total duration of about 90 minutes) show no statistically significant variation of the word error rate. Additional studies provide further insight based on both language properties and statistics of the word distributions in the two sources.

Details

Title Towards using slide information to enhance speech transcription of meetings

Author(s) Peregoudov, Artem ; Vinciarelli, Alessandro ; Bourlard, Hervé

Date 2006

Publisher IDIAP

Keywords

speech

Note Submitted for publication

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Actions

Preview

Select file: