Formant frequency estimation of high-pitched vowels using weighted linear prediction

Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne Maria Laukkanen, Brad H. Story

Research output: Contribution to journalArticlepeer-review

74 Scopus citations

Abstract

All-pole modeling is a widely used formant estimation method, but its performance is known to deteriorate for high-pitched voices. In order to address this problem, several all-pole modeling methods robust to fundamental frequency have been proposed. This study compares five such previously known methods and introduces a technique, Weighted Linear Prediction with Attenuated Main Excitation (WLP-AME). WLP-AME utilizes temporally weighted linear prediction (LP) in which the square of the prediction error is multiplied by a given parametric weighting function. The weighting downgrades the contribution of the main excitation of the vocal tract in optimizing the filter coefficients. Consequently, the resulting all-pole model is affected more by the characteristics of the vocal tract leading to less biased formant estimates. By using synthetic vowels created with a physical modeling approach, the results showed that WLP-AME yields improved formant frequencies for high-pitched sounds in comparison to the previously known methods (e.g., relative error in the first formant of the vowel [a] decreased from 11% to 3% when conventional LP was replaced with WLP-AME). Experiments conducted on natural vowels indicate that the formants detected by WLP-AME changed in a more regular manner between repetitions of different pitch than those computed by conventional LP.

Original languageEnglish (US)
Pages (from-to)1295-1313
Number of pages19
JournalJournal of the Acoustical Society of America
Volume134
Issue number2
DOIs
StatePublished - Aug 2013

ASJC Scopus subject areas

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Formant frequency estimation of high-pitched vowels using weighted linear prediction'. Together they form a unique fingerprint.

Cite this