Context-Aware Attention Mechanism for Speech Emotion Recognition

Ramet, Gaetan; Garner, Philip N.; Baeriswyl, Michael; Lazaridis, Alexandros

doi:10.1109/SLT.2018.8639633

conference paper

Context-Aware Attention Mechanism for Speech Emotion Recognition

Ramet, Gaetan

•

Garner, Philip N.

•

Baeriswyl, Michael

more

2018

2018 IEEE Spoken Language Technology Workshop (SLT)

IEEE Workshop on Spoken Language Technology

In this work, we study the use of attention mechanisms to enhance the performance of the state-of-the-art deep learning model in Speech Emotion Recognition (SER). We introduce a new Long Short-Term Memory (LSTM)-based neural network attention model which is able to take into account the temporal information in speech during the computation of the attention vector. The proposed LSTM-based model is evaluated on the IEMOCAP dataset using a 5-fold cross-validation scheme and achieved 68.8% weighted accuracy on 4 classes, which outperforms the state-of-the-art models.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/154378

Type

conference paper

DOI

10.1109/SLT.2018.8639633

Web of Science ID

WOS:000463141800019

Authors

Ramet, Gaetan

•

Garner, Philip N.

•

Baeriswyl, Michael

•

Lazaridis, Alexandros

Publication date

2018

Publisher

IEEE

Published in

2018 IEEE Spoken Language Technology Workshop (SLT)

ISBN of the book

978-1-5386-4334-1

Publisher place

New York

Series title/Series vol.

IEEE Workshop on Spoken Language Technology

Start page

126

End page

131

Subjects

speech emotion recogn...

attention

deep learning

neural network

URL

Event name	Event place	Event date
IEEE Workshop on Spoken Language Technology	Athens, Greece	Dec 18-21, 2018