From crabapple.srv.cs.cmu.edu!cantaloupe.srv.cs.cmu.edu!das-news.harvard.edu!noc.near.net!howland.reston.ans.net!math.ohio-state.edu!magnus.acs.ohio-state.edu!usenet.ins.cwru.edu!agate!headwall.Stanford.EDU!kithrup.com!mrs Wed Aug 11 01:25:08 EDT 1993 Article: 1130 of comp.speech Xref: crabapple.srv.cs.cmu.edu comp.speech:1130 Newsgroups: comp.speech Path: crabapple.srv.cs.cmu.edu!cantaloupe.srv.cs.cmu.edu!das-news.harvard.edu!noc.near.net!howland.reston.ans.net!math.ohio-state.edu!magnus.acs.ohio-state.edu!usenet.ins.cwru.edu!agate!headwall.Stanford.EDU!kithrup.com!mrs From: mrs@kithrup.com (Mike Stump) Subject: English word database (6,100 words) for klatt Organization: Kithrup Enterprises, Ltd. Message-ID: Reply-To: mrs@cygnus.com Date: Tue, 10 Aug 1993 21:53:39 GMT Lines: 70 You can use the below program with the klatt-0.02 code, and the TIMIT.mostlikely.Z database, to obtain a 6,100 english word pronunciation database. It is not perfect, but it is pretty reasonable. The single most horrible pronunciation is ss. Info to the TIMIT.mostlikely.Z database (from news posting): If you would like to get these pronunciations, they are available via anonymous ftp from ftp.icsi.berkeley.edu in the directory pub/speech. The file is called "TIMIT.mostlikely.Z". Info to klatt-0.02 code: The package klatt-0.02.tar.Z exists on svr-ftp.eng.cam.ac.uk in directory comp.speech/sources. #!/bin/sh # This file translates Allophones as used in the TIMIT database to # phonemes used by klatt-0.02. # phoneme codes that are ok as is: # ix en sed ' s/\/rr/g; s/\/rr/g; s/\/rr/g; s/\/vv/g; s/\/bb/g; s/\/bb/g; s/\/bb/g; s/\/dd/g; s/\/dd/g; s/\/dd/g; s/\/mm/g; s/\/nn/g; s/\/zz/g; s/\//g; s/\/ll/g; s/\/tt/g; s/\/tt/g; s/\/tt/g; s/\/oxr/g; s/\/jj/g; s/\/ss/g; s/\/zz/g; s/\/ff/g; s/\/pp/g; s/\/pp/g; s/\/pp/g; s/\/pp/g; s/\/kk/g; s/\/kk/g; s/\/kk/g; s/\/uw/g; s/\//g; s/\/gg/g; s/\/ww/g; s/\/yu/g; s/\/ /g; s/\/ah/g; s/\/nn/g; s/\/ng/g; s/\/hh/g; s/\/owhh/g; s/\/gg/g; s/\/ /g; ' | sed 's/ \(.\)/\1/g;'