Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings
 
research article

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings

Nallanthighal, Venkata Srikanth
•
Mostaani, Zohreh
•
Harma, Aki
Show more
September 1, 2021
Neural Networks

Respiration is an essential and primary mechanism for speech production. We first inhale and then produce speech while exhaling. When we run out of breath, we stop speaking and inhale. Though this process is involuntary, speech production involves a systematic outflow of air during exhalation characterized by linguistic content and prosodic factors of the utterance. Thus speech and respiration are closely related, and modeling this relationship makes sensing respiratory dynamics directly from the speech plausible, however is not well explored. In this article, we conduct a comprehensive study to explore techniques for sensing breathing signal and breathing parameters from speech using deep learning architectures and address the challenges involved in establishing the practical purpose of this technology. Estimating the breathing pattern from the speech would give us information about the respiratory parameters, thus enabling us to understand the respiratory health using one's speech. (C) 2021 The Authors. Published by Elsevier Ltd.

  • Details
  • Metrics
Type
research article
DOI
10.1016/j.neunet.2021.03.029
Web of Science ID

WOS:000681162400001

Author(s)
Nallanthighal, Venkata Srikanth
Mostaani, Zohreh
Harma, Aki
Strik, Helmer
Magimai-Doss, Mathew  
Date Issued

2021-09-01

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Published in
Neural Networks
Volume

141

Start page

211

End page

224

Subjects

Computer Science, Artificial Intelligence

•

Neurosciences

•

Computer Science

•

Neurosciences & Neurology

•

speech breathing

•

signal processing

•

deep neural networks

•

respiratory parameters

•

speech technology

•

automatic detection

•

neural-networks

•

rib cage

•

enhancement

•

regression

•

frequency

•

algorithm

•

volume

•

end

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
August 28, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/180980
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés