TY - JOUR
T1 - A spectral filtering method for tracking formants in children's speech
AU - Story, Brad H.
AU - Bunton, Kate
N1 - Publisher Copyright:
© 2015 Acoustical Society of America.
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Children's speech is typically characterized by high fundamental frequencies (e.g., 250-600 Hz) that create widely spaced harmonic components, producing an apparent undersampling of the vocal tract transfer function. The purpose of this study is to describe a formant measurement technique based on cepstral analysis that does not require modification of the cepstrum itself or transformation back to the spectral domain. Instead, the spectrum is low-pass filtered with a cutoff point (i.e., cutoff "quefrency" in the terminology of cepstral analysis) to preserve only the spectral envelope. To test the method, speech representative of a 2 to 3 year-old child was simulated with an airway modulation model of speech production. The model includes physiologically-scaled vocal folds, vocal tract, and trachea and generates sound output analogous to a microphone signal. The true formant frequencies can be calculated independently of the output signal and thus provide test cases that allow for assessing the accuracy of the formant tracking algorithm. Formant analysis will also be applied to children's natural speech samples to demonstrate the method.
AB - Children's speech is typically characterized by high fundamental frequencies (e.g., 250-600 Hz) that create widely spaced harmonic components, producing an apparent undersampling of the vocal tract transfer function. The purpose of this study is to describe a formant measurement technique based on cepstral analysis that does not require modification of the cepstrum itself or transformation back to the spectral domain. Instead, the spectrum is low-pass filtered with a cutoff point (i.e., cutoff "quefrency" in the terminology of cepstral analysis) to preserve only the spectral envelope. To test the method, speech representative of a 2 to 3 year-old child was simulated with an airway modulation model of speech production. The model includes physiologically-scaled vocal folds, vocal tract, and trachea and generates sound output analogous to a microphone signal. The true formant frequencies can be calculated independently of the output signal and thus provide test cases that allow for assessing the accuracy of the formant tracking algorithm. Formant analysis will also be applied to children's natural speech samples to demonstrate the method.
UR - http://www.scopus.com/inward/record.url?scp=84938874103&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84938874103&partnerID=8YFLogxK
U2 - 10.1121/2.0000055
DO - 10.1121/2.0000055
M3 - Conference article
AN - SCOPUS:84938874103
SN - 1939-800X
VL - 23
JO - Proceedings of Meetings on Acoustics
JF - Proceedings of Meetings on Acoustics
IS - 1
M1 - 060002
T2 - 169th Meeting of the Acoustical Society of America
Y2 - 18 May 2015 through 22 May 2015
ER -