Queensland University of Technology   Brisbane Australia Skip bannerSkip to content A university for the real world - Built Environment and Engineering
QUT Home
Contact us Staff Directory A-Z Index
BEE Home About the Faculty Study Research, industry and community For Staff

Speech Synthesis

Research, industry and community
Research
Research funding
Industry collaboration
Events and conferences
Consulting and professional services
Community service
Faculty and research projects
  Airborne Avionics Research Group
  Airport Metropolis
  Dual Fuel
  Liquid Dessicant Solar Air-Conditioner
  Medical Engineering Research Facility
  Nanango TIE QUT Observatory
  Organic Photovoltaics
  Phenomena in Microgravity Laboratory
  QUT Motorsport
  Speech, Audio, Image and Video Technologies
    Research
    Postgraduate Training
    Consultancy/Product Development
    Speech & Audio Research Lab
    Image & Video Research Lab
    Scholarships
    News & Events
    Publications
  Demonstrations
      Speech Enhancement
      Speech Coding (using Temporal Decomposition)
      Speech Coding (using Phonetic Vocoding)
      * Speech Synthesis
    Microphone Array Beamforming
    Students
    Staff
    Contact Us
  Transportation
  Tribology
  UAV Team
For research students

[Print-friendly version]

Trainable Speech Synthesis With Trended Hidden Markov Models

Work on a trainable speech synthesis system that utilises trended Hidden Markov Models to represent phonetic speech units has been implemented. The performance of this system has been compared with synthesis using the traditional stationary framework and has yielded significant improvement in informal modified rhyme tests. Some examples of speech synthesis are provided below:

Synthesis of isolated word for modified rhyme tests: Male Australian Speaker

Speech synthesised using a voice from the WSJ1 corpus, speaker 453:

(Ref: J. Dines and S. Sridharan, "Trainable speech synthesis with trended Hidden Markov Models," ICASSP-2001, pp.833 - 837, May 2001.)