Improving Children Speech Recognition through Feature Learning from Raw Speech Signal

Dubagunta, S. Pavankumar; Kabil, Selen Hande; Magimai.-Doss, Mathew

doi:10.1109/ICASSP.2019.8682826

conference paper

Improving Children Speech Recognition through Feature Learning from Raw Speech Signal

Dubagunta, S. Pavankumar

•

Kabil, Selen Hande

•

Magimai.-Doss, Mathew

2019

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (Icassp)

Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

Children speech recognition based on short-term spectral features is a challenging task. One of the reasons is that children speech has high fundamental frequency that is comparable to formant frequency values. Furthermore, as children grow, their vocal apparatus also undergoes changes. This presents difficulties in extracting standard short-term spectral-based features reliably for speech recognition. In recent years, novel acoustic modeling methods have emerged that learn both the feature and phone classifier in an end-to-end manner from the raw speech signal. Through an investigation on PF-STAR corpus we show that children speech recognition can be improved using end-to-end acoustic modeling methods.

Type

conference paper

DOI

10.1109/ICASSP.2019.8682826

Web of Science ID

WOS:000482554005193

Authors

Dubagunta, S. Pavankumar

•

Kabil, Selen Hande

•

Magimai.-Doss, Mathew

Publication date

2019

Publisher

IEEE

Published in

2019 IEEE International Conference on Acoustics, Speech and Signal Processing (Icassp)

Publisher place

New York

Start page

5736

End page

5740

Subjects

acoustic modeling

Children speech recog...

Convolutional Neural ...

end-to-end training

URL

Related documents

http://publications.idiap.ch/downloads/papers/2019/Dubagunta_ICASSP-3_2019.pdf

EPFL units

LIDIAP

Event name	Event place	Event date
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)	Brighton, ENGLAND	May 12-17, 2019

Available on Infoscience

February 25, 2019

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/154739