Considerations in voice transformation with physiologic scaling principles

Ingo Titze, Darrell Wong, Brad Story, Russell Long

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.

Original languageEnglish (US)
Pages (from-to)113-123
Number of pages11
JournalSpeech Communication
Issue number2-3
StatePublished - Aug 1997


  • Speech simulation
  • Speech synthesis
  • Voice conversion
  • Voice transformation
  • Vowel quality

ASJC Scopus subject areas

  • Software
  • Modeling and Simulation
  • Communication
  • Language and Linguistics
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications


Dive into the research topics of 'Considerations in voice transformation with physiologic scaling principles'. Together they form a unique fingerprint.

Cite this