Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Learning voice source related information for depression detection
 
Loading...
Thumbnail Image
conference paper

Learning voice source related information for depression detection

Dubagunta, S. Pavankumar
•
Vlasenko, Bogdan
•
Magimai.-Doss, Mathew
2019
2019 IEEE International Conference on Acoustics, Speech and Signal Processing (Icassp)
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

During depression neurophysiological changes can occur, which may affect laryngeal control i.e. behaviour of the vocal folds. Characterising these changes in a precise manner from speech signals is a non trivial task, as this typically involves reliable separation of the voice source information from them. In this paper, by exploiting the abilities of CNNs to learn task-relevant information from the input raw signals, we investigate several methods to model voice source related information for depression detection. Specifically, we investigate modelling of low pass filtered speech signals, linear prediction residual signals, homomorphically filtered voice source signals and zero frequency filtered signals to learn voice source related information for depression detection. Our investigations show that subsegmental level modelling of linear prediction residual signals or zero frequency filtered signals leads to systems better than the state-of-the-art low level descriptor based systems and deep learning based systems modelling the vocal tract system information.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2019.8683498
Web of Science ID

WOS:000482554006151

Author(s)
Dubagunta, S. Pavankumar
•
Vlasenko, Bogdan
•
Magimai.-Doss, Mathew
Date Issued

2019

Publisher

IEEE

Publisher place

New York

Journal
2019 IEEE International Conference on Acoustics, Speech and Signal Processing (Icassp)
Start page

6525

End page

6529

Subjects

Convolutional Neural Networks

•

depression detection

•

glottal source signals.

•

zero-frequency filtering

URL

Related documents

http://publications.idiap.ch/downloads/papers/2019/Dubagunta_ICASSP-2_2019.pdf
Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Brighton, ENGLAND

May 12-17, 2019

Available on Infoscience
February 25, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/154738
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés