Using pitch frequency information in speech recognition

Magimai.-Doss, Mathew; Stephenson, Todd Andrew; Bourlard, Hervé

doi:10.21437/Eurospeech.2003-692

conference paper

Using pitch frequency information in speech recognition

Magimai.-Doss, Mathew

•

Stephenson, Todd Andrew

•

Bourlard, Hervé

2003

Proceedings of Eurospeech

8th European Conference on Speech Communication and Technology (Eurospeech 2003)

Automatic Speech Recognition systems typically use smoothed spectral features as acoustic observations. In recent studies, it has been shown that complementing these standard features with pitch frequency could improve the system performance of the system. While previously proposed systems have been studied in the framework of HMM/GMMs, in this paper we study and compare different ways to include pitch frequency in state-of-the-art hybrid HMM/ANN system. We have evaluated the proposed system on two different ASR tasks, namely, isolated word recognition and connected word recognition. Our results show that pitch frequency can indeed be used in ASR systems to improve the recognition performance.

Type

conference paper

DOI

10.21437/Eurospeech.2003-692

Authors

Magimai.-Doss, Mathew

•

Stephenson, Todd Andrew

•

Bourlard, Hervé

Publication date

2003

Published in

Proceedings of Eurospeech

Publisher place

Geneva, Switzerland

Volume

4

Start page

2525

End page

2528

Subjects

speech

mathew

Note

IDIAP-RR 03-23

URL

http://publications.idiap.ch/downloads/reports/2002/rr03-23.pdf