Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
 
research article

Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution

Schleusing, Olaf
•
Kinnunen, Tomi
•
Story, Brad
Show more
2013
Ieee Transactions On Audio Speech And Language Processing

In this work, we present a joint source-filter optimization approach for separating voiced speech into vocal tract (VT) and voice source components. The presented method is pitch-synchronous and thereby exhibits a high robustness against vocal jitter, shimmer and other glottal variations while covering various voice qualities. The voice source is modeled using the Liljencrants-Fant (LF) model, which is integrated into a time-varying auto-regressive speech production model with exogenous input (ARX). The non-convex optimization problem of finding the optimal model parameters is addressed by a heuristic, evolutionary optimization method called differential evolution. The optimization method is first validated in a series of experiments with synthetic speech. Estimated glottal source and VT parameters are the criteria used for comparison with the iterative adaptive inverse filter (IAIF) method and the linear prediction (LP) method under varying conditions such as jitter, fundamental frequency (f(0)) as well as environmental and glottal noise. The results show that the proposed method largely reduces the bias and standard deviation of estimated VT coefficients and glottal source parameters. Furthermore, the performance of the source-filter separation is evaluated in experiments using speech generated with a physical model of speech production. The proposed method reliably estimates glottal flow waveforms and lower formant frequencies. Results obtained for higher formant frequencies indicate that research on more accurate voice source models and their interaction with the VT is necessary to improve the source-filter separation. The proposed optimization approach promises to be a useful tool for future research addressing this topic.

  • Details
  • Metrics
Type
research article
DOI
10.1109/Tasl.2013.2255275
Web of Science ID

WOS:000318545200003

Author(s)
Schleusing, Olaf
•
Kinnunen, Tomi
•
Story, Brad
•
Vesin, Jean-Marc  
Date Issued

2013

Publisher

Ieee-Inst Electrical Electronics Engineers Inc

Published in
Ieee Transactions On Audio Speech And Language Processing
Volume

21

Issue

8

Start page

1560

End page

1572

Subjects

Global optimization

•

differential evolution

•

joint source-filter optimization

•

glottal inverse filtering

•

time-varying vocal tract estimation

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
SCI-STI-JMV  
Available on Infoscience
October 1, 2013
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/95718
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés