Newsgroups: comp.speech
Path: pavo.csi.cam.ac.uk!warwick!pipex!uunet!munnari.oz.au!metro!brutus!andrewh
From: andrewh@brutus.ee.su.oz.au (Andrew Hunt)
Subject: Re: telephony speech database
Message-ID: <1993Jul28.071433.16316@ucc.su.OZ.AU>
Keywords: Speech database
Sender: andrewh@brutus (Andrew Hunt)
Nntp-Posting-Host: brutus.ee.su.oz.au
Organization: University of Sydney, Speech Group
References:  <CAsDqn.47x@carmen.logica.co.uk>
Date: Wed, 28 Jul 1993 07:14:33 GMT
Lines: 41

In article <CAsDqn.47x@carmen.logica.co.uk>, you write:
|> I am looking for publically/commercially available telephony speech database of
|> any language. e.g. English, Russian, Mandarin, etc. So long as they have been
|> collectd over telephone lines/channels.


From the FAQ posting...

 > Center for Spoken Language Understanding (CSLU)
 >
 > 2. CSLU has a telephone speech corpus of 1000 English alphabets.  Callers
 > recite the alphabet with brief pauses between letters.  This database is
 > available to not-for-profit institutions for $100. The data base is described
 > in the proceedings of the International Conference on Spoken Language
 > Processing.  Contact vincew@cse.ogi.edu if interested.



Also, from the Linguistic Data Consortium (LDC) you can get NTIMIT
which is a telephone version of TIMIT.  Both of these are large 
speaker independent databases of read sentences.  Available on CD-ROM.  
For further information contact

  Elizabeth Hodas
  441 Williams Hall
  University of Pennsylvania
  Philadelphia, PA 19104-6305
  Phone:   (215) 898-0464
  Fax:     (215) 573-2175
  e-mail:  ehodas@walnut.ling.upenn.edu


Both these databases are in English.

I have heard there are quite a few more being collected.  If anyone
can give me information on them I will include it in the FAQ posting.

Cheers,

Andrew Hunt
<andrewh@ee.su.oz.au>
