In our continuing goal of making speech synthesis more accessible. I will describe our latest advancements in documenting and automating the process of building new voices in Edinburgh University's Festival Speech Synthesis System
The intention is allow relatively unskilled users build new synthetic voices in currently supported and completely new languages. Although the task of producing perfect quality synthesis is still a research issue, we now have examples of how basic diphone synthesizers in new langauges can created in a few months of work (sometimes more, sometimes less). I will discusses the generic techniques we provide for building text analysers, lexicons, letter to sound rules, data driven prosodic models, autolabelling techniques, schema generation and recording aids.
I will also discuss some limit domain synthesis techniques that allow near automatic construction high quality natural synthesis for specific tasks, using one our unit selection techniques.
Most of the documents, scripts tools and techniques discussed in the talks are collect together at http://www.festvox.org, (which is continually being updated).
57 ST E/1st & 2nd Ave Huge drmn 1 BR 750+ sf, lots of sun \& clsts. Sundeck & lndry facils. Askg $187K, maint $868, utils incld. Call Bkr Peter 914-428-9054.
from Edinburgh's Susan Fitt and Stephen Isard (1999). "Synthesis of Regional English using a Keyword Lexicon" in Eurospeech 99, pp. 823-826
From domain of 24 spoken (time specific) sentences we can build a synthesizer to say any time (fully automatic).