Abstract
This study begins to explore the importance of the physiological domain in voice transformation. A general approach is outlined for transforming the voice quality of sentence-level speech while maintaining the same phonetic content. Transformations will eventually include gender, age, voice quality, emotional state, disordered state, dialect or impersonation. In this paper, only a specific voice quality, twang, is described as an example. The basic question is: relative to pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches are contrasted, a Linear Predictive Coding approach and a biomechanical simulation approach.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 113-123 |
| Number of pages | 11 |
| Journal | Speech Communication |
| Volume | 22 |
| Issue number | 2-3 |
| DOIs | |
| State | Published - Aug 1997 |
| Externally published | Yes |
Keywords
- Speech simulation
- Speech synthesis
- Voice conversion
- Voice transformation
- Vowel quality
ASJC Scopus subject areas
- Software
- Modeling and Simulation
- Communication
- Language and Linguistics
- Linguistics and Language
- Computer Vision and Pattern Recognition
- Computer Science Applications