TY - JOUR
T1 - Parameterization of vocal tract area functions by empirical orthogonal modes
AU - Story, Brad H.
AU - Titze, Ingo R.
N1 - Funding Information:
This work was supported by Grant R01 DC02532 from the National Institute on Deafness and other Communication Disorders. The author would like to thank Dr. David Berry for fruitful discussions on empirical orthogonal mode decomposition. Three reviewers are also acknowledged for their helpful suggestions on improving the original manuscript.
PY - 1998/7
Y1 - 1998/7
N2 - A set of ten vowel area functions, based on MRI measurements, has been parameterized by an "empirical orthogonal mode decomposition" which accurately represents each area function as the sum of the mean area function and proportional amounts of a series of orthogonal basis functions. The mean area function was found to possess a formant structure similar to that of a uniform tube (i.e., nearly equally spaced formants) suggesting that empirical orthogonal modes are perturbations on the mean (∼ neutral) vowel shape much like past vocal tract analyses have considered perturbations on a uniform tube. The acoustic characteristics of the two most significant empirical orthogonal modes were examined, showing that both modes tend to increase the first formant as the modal amplitude coefficients are both increased from negative to positive values. However, the second formant was found to decrease in frequency for increasing values of the first modal coefficient and to increase for increasing values of the second mode coefficient. Next, a mapping between F1-F2 formant pairs and vocal tract area functions is proposed which is largely one-to-one but was initially limited by a constant vocal tract length. A possible method to include variable vocal tract length and higher ordered orthogonal modes in the mapping is given. The mode-to-formant mapping suggested the possibility of an inverse mapping to determine physiologically realistic area functions from a speech waveform and a simple example is presented. Finally, empirical orthogonal modes for a collection of ten vowels and eight consonants were derived and showed many similarities to those for the vowel-only case.
AB - A set of ten vowel area functions, based on MRI measurements, has been parameterized by an "empirical orthogonal mode decomposition" which accurately represents each area function as the sum of the mean area function and proportional amounts of a series of orthogonal basis functions. The mean area function was found to possess a formant structure similar to that of a uniform tube (i.e., nearly equally spaced formants) suggesting that empirical orthogonal modes are perturbations on the mean (∼ neutral) vowel shape much like past vocal tract analyses have considered perturbations on a uniform tube. The acoustic characteristics of the two most significant empirical orthogonal modes were examined, showing that both modes tend to increase the first formant as the modal amplitude coefficients are both increased from negative to positive values. However, the second formant was found to decrease in frequency for increasing values of the first modal coefficient and to increase for increasing values of the second mode coefficient. Next, a mapping between F1-F2 formant pairs and vocal tract area functions is proposed which is largely one-to-one but was initially limited by a constant vocal tract length. A possible method to include variable vocal tract length and higher ordered orthogonal modes in the mapping is given. The mode-to-formant mapping suggested the possibility of an inverse mapping to determine physiologically realistic area functions from a speech waveform and a simple example is presented. Finally, empirical orthogonal modes for a collection of ten vowels and eight consonants were derived and showed many similarities to those for the vowel-only case.
UR - http://www.scopus.com/inward/record.url?scp=0032116042&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0032116042&partnerID=8YFLogxK
U2 - 10.1006/jpho.1998.0076
DO - 10.1006/jpho.1998.0076
M3 - Article
AN - SCOPUS:0032116042
SN - 0095-4470
VL - 26
SP - 223
EP - 260
JO - Journal of Phonetics
JF - Journal of Phonetics
IS - 3
ER -