Computer Speech and Language vol:9 issue:3 pages:257-288
A new quantitative model of tonal perception for continuous speech is described. The
paper illustrates its ability for automatic stylization of pitch contours, with applications to
prosodic analysis and speech synthesis inmind, and evaluates it in a perception experiment.
After a discussion of the psychoacoustics of tonal perception and an overview of
existing tonal perception models and systems for automatic analysis of intonation, the
model and its computer implementation are described in detail. It includes parameter
extraction, segmentation into syllables, perceptual integration of short term pitch change,
tonal segment computation, and pitch contour stylization.
This is followed by a perception experiment in which subjects are asked to distinguish
original signals from resynthesized signals with automatically stylized pitch contours.
The aim of this experiment is to show the usefulness of the model as a basis for intonation
representation, and to study the influence of the model parameters. It is shown that the
stylization obtained with the model is an economic representation of intonation which can
be useful for speech synthesis and prosodic analysis.