Modeling Auxiliary Information in Bayesian Network Based ASR

Automatic speech recognition bases its models on the acoustic features derived from the speech signal. Some have investigated replacing or supplementing these features with information that can not be precisely measured (articulator positions, pitch, gender, etc.) automatically. Consequently, automatic estimations of the desired information would be generated. This data can degrade performance due to its imprecisions. In this paper, we describe a system that treats pitch as an auxiliary information within the framework of Bayesian networks, resulting in improved performance.


Published in:
7th European Conference on Speech Communication and Technology (Eurospeech 2001), 4, 2765-2768
Presented at:
7th European Conference on Speech Communication and Technology (Eurospeech~2001)
Year:
2001
Publisher:
Aalborg, Denmark
Keywords:
Note:
IDIAP-RR 01-11
Laboratories:




 Record created 2006-03-10, last modified 2018-03-17

n/a:
Download fulltextPDF
External links:
Download fulltextURL
Download fulltextRelated documents
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)