Convolutional Pitch Target Approximation Model for Speech Synthesis

Na, Xingyu; Garner, Philip N.

2013

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

In this paper, we investigate pitch contour modelling in speech synthesis based on segmental units. A convolutional pitch target approximation model is proposed. This model allows jointly stochastic modelling of framewise pitch and pitch contour of longer units, of which the intuitive relations are revealed by a convolutional target approximation filter. The pitch contour is stylized by a linear representation called pitch target. In synthesis stage, the likelihood of the framewise model and the pitch target model are jointly maximized using a Toeplitz matrix representing the discrete convolutional filter.

Details

Title Convolutional Pitch Target Approximation Model for Speech Synthesis

Author(s) Na, Xingyu ; Garner, Philip N.

Date 2013

Publisher Idiap

Keywords

Pitch modelling; pitch target approximation; speech synthesis

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Work produced at EPFL
Technical Reports

Record creation date 2013-12-19

Actions

Preview

Select file: