Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation
 
conference paper

On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures And Breathing Parameters Estimation

Mostaani, Zohreh
•
Nallanthighal, Venkata Srikanth
•
Harma, Aki
Show more
January 1, 2021
2021 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp 2021)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The respiratory system is one of the major components of the speech production system. Any alteration in breathing can result in changes in speech. Specific breathing characteristics, such as breathing rate and tidal volume, can indicate a person's pathological condition. More recently, neural network-based methods have started emerging for predicting the breathing signal from the speech signal. The neural networks are trained and evaluated with different objective measures, such as mean squared error (MSE) and Pearson's correlation. This paper investigates whether there is a systematic relationship between the different objective measures used for training and evaluating the neural network models and the end-goal, i.e. estimation of breathing parameters such as, breathing rate and tidal volume. Our investigations on two different data sets with two different neural network-based approaches show that there is no clear systematic relationship. In other words, obtaining a high Pearson's correlation on the evaluation set does not necessarily mean better breathing parameter estimation. Thus, indicating the need for developing other objective evaluation measures.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP39728.2021.9414756
Web of Science ID

WOS:000704288401118

Author(s)
Mostaani, Zohreh
Nallanthighal, Venkata Srikanth
Harma, Aki
Strik, Helmer
Magimai-Doss, Mathew  
Date Issued

2021-01-01

Publisher

IEEE

Publisher place

New York

Published in
2021 Ieee International Conference On Acoustics, Speech And Signal Processing (Icassp 2021)
ISBN of the book

978-1-7281-7605-5

Start page

1345

End page

1349

Subjects

Acoustics

•

Computer Science, Artificial Intelligence

•

Computer Science, Software Engineering

•

Engineering, Electrical & Electronic

•

Imaging Science & Photographic Technology

•

Computer Science

•

Engineering

•

respiratory parameters

•

neural networks

•

speech breathing

•

volume

•

end

URL

Link to IDIAP database

http://publications.idiap.ch/downloads/papers/2021/Mostaani_ICASSP_2021.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Event nameEvent placeEvent date
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

ELECTR NETWORK

Jun 06-11, 2021

Available on Infoscience
December 4, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/183604
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés