Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. An End-to-End Networks to Synthetize Intonation using a Generalized Command Response Model
 
conference paper

An End-to-End Networks to Synthetize Intonation using a Generalized Command Response Model

Marelli, François
•
Schnell, Bastian
•
Bourlard, Hervé
Show more
2019
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing

The generalized command response (GCR) model represents intonation as a superposition of muscle responses to spike command signals. We have previously shown that the spikes can be predicted by a two-stage system, consisting of a recurrent neural network and a post-processing procedure, but the responses themselves were fixed dictionary atoms. We propose an end-to-end neural architecture that replaces the dictionary atoms with trainable second-order recurrent elements analogous to recursive filters. We demonstrate gradient stability under modest conditions, and show that the system can be trained by imposing temporal sparsity constraints. Subjective listening tests demonstrate that the system can synthesize intonation with high naturalness, comparable to state-of-the-art acoustic models, and retains the physiological plausibility of the GCR model.

  • Files
  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICASSP.2019.8683815
Web of Science ID

WOS:000482554007055

Author(s)
Marelli, François
Schnell, Bastian
Bourlard, Hervé
Dutoit, T.
Garner, Philip N.
Date Issued

2019

Publisher

Idiap

Published in
ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing
Start page

7040

End page

7044

Subjects

Digital IIR Filters

•

Fujisaki Model

•

neural networks

•

Prosody Modelling

•

speech synthesis

URL

Related documents

http://publications.idiap.ch/downloads/reports/2018/Marelli_Idiap-RR-05-2019.pdf
Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LIDIAP  
Available on Infoscience
May 27, 2019
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/156553
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés