Skip to main content

A Predictive Minimum Description Length (MDL) Approach to Speech Clustering and Segmentation

New Image

The method has been applied to the problem of statistical clustering of speech ceptral coefficients into Gaussian mixtures. When the data has a temporal order, e.g. speech frames in continues speech, the prediction error, as a function of time, can be used for segmentation, as well as an alternative cost function for statistical modeling. The average prediction error approaches a limiting function of time, asymptotically independent of the particular modeling scheme, and thus is a characteristic of the speech signal itself.