The experiments described show that RARMA (robust autoregressive moving average) analysis can be applied to the extraction of speech synthesis rules for nasal consonants that bear relationship to articulatory gestures. This is an important advantage in comparison to the results of the usual all-pole techniques. The poles and zeros resolved by RARMA could be directly translated into target frequencies for the vocal tract (anti-)resonators in this synthesis model. Inspection of short-term FFTs of natural speech utterances showed that during the realization of the nasal consonant an additional zero-per-pole is present around 800 Hz that is not resolved by RARMA. This special event rides on the steep spectral slope of the first resonance peak in the nasal consonant.
展开▼