PLP$^2$: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns
The temporal trajectories of the spectral energy in auditory critical bands over 250 ms segments are approximated by an all-pole model, the time-domain dual of conventional linear prediction. This quarter-second auditory spectro-temporal pattern is further smoothed by iterative alternation of spectral and temporal all-pole modeling. Just as Perceptual Linear Prediction (PLP) uses an autoregressive model in the frequency domain to estimate peaks in an auditory-like short-term spectral slice, PLP$^2$ uses all-pole modeling in both time and frequency domains to estimate peaks of a two-dimensional spectro-temporal pattern, motivated by considerations of the auditory system.
Accepted for publication in the Workshop on Statistical and Perceptual Audio Processing (SAPA) 2004
Record created on 2006-03-10, modified on 2016-08-08