Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Adversarial-Free Speaker Identity-Invariant Representation Learning for Automatic Dysarthric Speech Classification
 
conference paper

Adversarial-Free Speaker Identity-Invariant Representation Learning for Automatic Dysarthric Speech Classification

Janbakhshi, Parvaneh
•
Kodrasi, Ina  
January 1, 2022
Interspeech 2022
Interspeech Conference

Speech representations which are robust to pathology-unrelated cues such as speaker identity information have been shown to be advantageous for automatic dysarthric speech classification. A recently proposed technique to learn speaker identity-invariant representations for dysarthric speech classification is based on adversarial training. However, adversarial training can be challenging, unstable, and sensitive to training parameters. To avoid adversarial training, in this paper we propose to learn speaker-identity invariant representations exploiting a feature separation framework relying on mutual information minimization. Experimental results on a database of neurotypical and dysarthric speech show that the proposed adversarial-free framework successfully learns speaker identity-invariant representations. Further, it is shown that such representations result in a similar dysarthric speech classification performance as the representations obtained using adversarial training, while the training procedure is more stable and less sensitive to training parameters.

  • Details
  • Metrics
Type
conference paper
DOI
10.21437/Interspeech.2022-402
Web of Science ID

WOS:000900724502063

Author(s)
Janbakhshi, Parvaneh
Kodrasi, Ina  
Date Issued

2022-01-01

Publisher

ISCA-INT SPEECH COMMUNICATION ASSOC

Publisher place

Baixas

Published in
Interspeech 2022
Series title/Series vol.

Interspeech

Start page

2138

End page

2142

Subjects

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science, Artificial Intelligence

•

Engineering, Electrical & Electronic

•

Acoustics

•

Audiology & Speech-Language Pathology

•

Computer Science

•

Engineering

•

parkinson's disease

•

speaker identity

•

feature separation

•

supervised autoencoder

•

mutual information

•

parkinsons-disease

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

Event nameEvent placeEvent date
Interspeech Conference

Incheon, SOUTH KOREA

Sep 18-22, 2022

Available on Infoscience
March 27, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/196431
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés