Using pitch frequency information in speech recognition

Magimai.-Doss, Mathew; Stephenson, Todd Andrew; Bourlard, Hervé

Magimai.-Doss, Mathew; Stephenson, Todd Andrew; Bourlard, Hervé

2003

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Automatic Speech Recognition systems typically use smoothed spectral features as acoustic observations. In recent studies, it has been shown that complementing these standard features with pitch frequency could improve the system performance of the system. While previously proposed systems have been studied in the framework of HMM/GMMs, in this paper we study and compare different ways to include pitch frequency in state-of-the-art hybrid HMM/ANN system. We have evaluated the proposed system on two different ASR tasks, namely, isolated word recognition and connected word recognition. Our results show that pitch frequency can indeed be used in ASR systems to improve the recognition performance.

Details

Title Using pitch frequency information in speech recognition

Author(s) Magimai.-Doss, Mathew ; Stephenson, Todd Andrew ; Bourlard, Hervé

Date 2003

Publisher IDIAP

Keywords

speech; mathew

Additional link URL

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports
Published

Record creation date 2006-03-10

Abstract

Details

Actions