Automatic glottal inverse filtering with the Markov chain Monte Carlo method

Harri Auvinen, Tuomo Raitio, Manu Airaksinen, Samuli Siltanen, Brad H. Story, Paavo Alku

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

This paper presents a new glottal inverse filtering (GIF) method that utilizes a Markov chain Monte Carlo (MCMC) algorithm. First, initial estimates of the vocal tract and glottal flow are evaluated by an existing GIF method, iterative adaptive inverse filtering (IAIF). Simultaneously, the initially estimated glottal flow is synthesized using the Rosenberg-Klatt (RK) model and filtered with the estimated vocal tract filter to create a synthetic speech frame. In the MCMC estimation process, the first few poles of the initial vocal tract model and the RK excitation parameter are refined in order to minimize the error between the synthetic and original speech signals in the time and frequency domain. MCMC approximates the posterior distribution of the parameters, and the final estimate of the vocal tract is found by averaging the parameter values of the Markov chain. Experiments with synthetic vowels produced by a physical modeling approach show that the MCMC-based GIF method gives more accurate results compared to two known reference methods.

Original languageEnglish (US)
Pages (from-to)1139-1155
Number of pages17
JournalComputer Speech and Language
Volume28
Issue number5
DOIs
StatePublished - Sep 2014

Keywords

  • Glottal inverse filtering
  • Markov chain Monte Carlo

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Automatic glottal inverse filtering with the Markov chain Monte Carlo method'. Together they form a unique fingerprint.

Cite this