Context-Aware Attention Mechanism for Speech Emotion Recognition

Lazaridis, Alexandros

doi:10.1109/SLT.2018.8639633

conference paper

Context-Aware Attention Mechanism for Speech Emotion Recognition

Ramet, Gaetan

•

Garner, Philip N.

•

Baeriswyl, Michael

2018

2018 IEEE Spoken Language Technology Workshop (SLT)

IEEE Workshop on Spoken Language Technology

In this work, we study the use of attention mechanisms to enhance the performance of the state-of-the-art deep learning model in Speech Emotion Recognition (SER). We introduce a new Long Short-Term Memory (LSTM)-based neural network attention model which is able to take into account the temporal information in speech during the computation of the attention vector. The proposed LSTM-based model is evaluated on the IEMOCAP dataset using a 5-fold cross-validation scheme and achieved 68.8% weighted accuracy on 4 classes, which outperforms the state-of-the-art models.

Type

conference paper

DOI

10.1109/SLT.2018.8639633

Web of Science ID

WOS:000463141800019

Author(s)

Ramet, Gaetan

Garner, Philip N.

Baeriswyl, Michael

Lazaridis, Alexandros

Date Issued

2018

Publisher

IEEE

Publisher place

New York

Published in

2018 IEEE Spoken Language Technology Workshop (SLT)

ISBN of the book

978-1-5386-4334-1

Series title/Series vol.

IEEE Workshop on Spoken Language Technology

Start page

126

End page

131

Subjects

speech emotion recognition

•

attention

•

deep learning

•

neural network

URL

Related documents

http://publications.idiap.ch/downloads/papers/2018/Ramet_SLT_2018.pdf

URL

http://www.slt2018.org/

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

LIDIAP

Event name	Event place	Event date
IEEE Workshop on Spoken Language Technology	Athens, Greece	Dec 18-21, 2018

Available on Infoscience

February 6, 2019

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/154378