Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Implementation of VTLN for Statistical Speech Synthesis
 
conference paper

Implementation of VTLN for Statistical Speech Synthesis

Saheer, Lakshmi
•
Dines, John  
•
Garner, Philip N.
Show more
2010
7th ISCA Workshop on Speech Synthesis (SSW 7)
Proceedings of ISCA Speech Synthesis Workshop

Vocal tract length normalization is an important feature normalization technique that can be used to perform speaker adaptation when very little adaptation data is available. It was shown earlier that VTLN can be applied to statistical speech synthesis and was shown to give additive improvements to CMLLR. This paper presents an EM optimization for estimating more accurate warping factors. The EM formulation helps to embed the feature normalization in the HMM training. This helps in estimating the warping factors more efficiently and enables the use of multiple (appropriate) warping factors for different state clusters of the same speaker.

  • Files
  • Details
  • Metrics
Loading...
Thumbnail Image
Name

Saheer_ISCASPEECHSYNTHESISWORKSHOP(SSW7)_2010.pdf

Access type

openaccess

Size

105.55 KB

Format

Adobe PDF

Checksum (MD5)

2246d056839e0d23c25d2b14a4b361f7

Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés