Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment

Asaei, Afsaneh; Cernak, Milos; Bourlard, Hervé; Ram, Dhananjay

Asaei, Afsaneh; Cernak, Milos; Bourlard, Hervé; Ram, Dhananjay

2017

Formats

Format
BibTeX
MARC
MARCXML
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Speech is a complex signal produced by a highly constrained articulation machinery. Neuro and psycholinguistic theories assert that speech can be decomposed into molecules of structured atoms. Although characterization of the atoms is controversial, the experiments support the notion of invariant speech codes governing speech production and perception. We exploit deep neural network (DNN) invariant representation learning for probabilistic characterization of the phone attributes defined in terms of the phonological classes and known as the smallest-size perceptual categories. We cast speech perception as a channel for phoneme information transmission via the phone attributes. Structured sparse codes are identified from the phonological probabilities for natural speech pronunciation. We exploit the sparse codes in information transmission analysis for assessment of phoneme pronunciation. The linguists define a single binary phonological code per phoneme. In contrast, probabilistic estimation of the phonological classes enables us to capture large variation in structures of speech pronunciation. Hence, speech assessment may not be confined to the single expert knowledge based mapping between phoneme and phonological classes and it may be extended to multiple data-driven mappings observed in natural speech.

Details

Title Sparse Pronunciation Codes for Perceptual Phonetic Information Assessment

Author(s) Asaei, Afsaneh ; Cernak, Milos ; Bourlard, Hervé ; Ram, Dhananjay

Conference Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS)

Date 2017

Note Proceeding of Abstracts for Communication

Laboratories LIDIAP

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LIDIAP - L'IDIAP Laboratory
Scientific production and competences > Euler Center for Signal Processing
Conference Papers
Work produced at EPFL

Record creation date 2017-05-19

Actions

Preview

Select file: