Newsgroups: comp.speech
Path: cantaloupe.srv.cs.cmu.edu!das-news2.harvard.edu!news2.near.net!news.mathworks.com!uhog.mit.edu!bloom-beacon.mit.edu!spool.mu.edu!howland.reston.ans.net!news.sprintlink.net!EU.net!uunet!in1.uu.net!news.iij.ad.jp!wnoc-tyo-news!aist-nara!wnoc-kyo-news!atrwide!atr-la!daemon
From: andrew@itl.atr.co.jp (Andrew Hunt)
Subject: Reminder: comp.speech FAQ (Frequently Asked Questions)
Message-ID: <D9vp7n.5HB@itl.atr.co.jp>
Sender: daemon@itl.atr.co.jp (Mr Background)
Organization: ATR Interpreting Telecommunications Research Labs., Japan
Date: Thu, 8 Jun 1995 23:57:23 GMT
Lines: 257



                          COMP.SPEECH FAQ WEEKLY REMINDER

   A Frequently Asked Questions (FAQ) posting is available for the comp.speech
   newsgroup. It covers a range of speech technology issues and provides
   details on over 150 speech technology products, software packages and
   resources. Please check the FAQ before posting a request for information.
   The list of software and products and the FAQ contents are included below.

   The FAQ is posted every 4 weeks to comp.speech, comp.answers &
   news.answers. (This reminder is posted weekly to comp.speech).

   It is also available for ftp from the comp.speech archive site:
     *  ftp://svr-ftp.eng.cam.ac.uk/pub/comp.speech/FAQ-complete

   Or from the news.answers ftp site (and its mirrors):
     *  ftp://rtfm.mit.edu/pub/usenet/comp.speech/*

   Or on the World Wide Web:
     * http://www.speech.su.oz.au/comp.speech
     * http://svr-www.eng.cam.ac.uk/comp.speech

   Or by sending email to mail-server@rtfm.mit.edu with the following line in
   the body of the message:
     * send usenet/news.answers/comp-speech-faq/*


___________________________________________________________________________

                                 LIST OF QUESTIONS

  FAQ SECTION 1: GENERAL INFORMATION ON SPEECH TECHNOLOGY

          * Q1.1: What is comp.speech?
          * Q1.2: Where are the comp.speech archives?
          * Q1.3: Common abbreviations and jargon
          * Q1.4: Related newsgroups and mailing lists
          * Q1.5: Related journals and conferences
          * Q1.6: Handicap Aids
          * Q1.7: What speech data is available?
          * Q1.8: Speech File Formats and Conversion
          * Q1.9: Speech Laboratory Environments
          * Q1.10: Miscellaneous Software and Resources

  FAQ SECTION 2: SIGNAL PROCESSING

          * Q2.1: What sampling do I need for speech?
          * Q2.2: How do I find the pitch of a speech signal?
          * Q2.3: How do I find the start and end points of a speech signal?
          * Q2.4: Where can I find FFT software?
          * Q2.5: Signal processing in speech technology
          * Q2.6: Speech sampling and signal processing hardware
          * Q2.7: How do I convert to/from mu-law format?

  FAQ SECTION 3: SPEECH CODING AND COMPRESSION

          * Q3.1: Speech compression techniques
          * Q3.2: References on coding/compression
          * Q3.3: Compression and Coding Software

  FAQ SECTION 4: NATURAL LANGUAGE PROCESSING

          * Q4.1: NLP References and Books
          * Q4.2: NLP Software

  FAQ SECTION 5: SPEECH SYNTHESIS

          * Q5.1: What is speech synthesis?
          * Q5.2: How can speech synthesis be performed?
          * Q5.3: References/Books on Synthesis
          * Q5.4: The WWW on Speech Synthesis
          * Q5.5: Speech Synthesis Software/Hardware

  FAQ SECTION 6: SPEECH RECOGNITION

          * Q6.1: What is speech recognition?
          * Q6.2: How is speech recognition performed?
          * Q6.3: How can I build a simple speech recogniser?
          * Q6.4: References & books on speech recognition
          * Q6.5: Speech Recognition Hardware/Software


___________________________________________________________________________

                       LIST OF SOFTWARE/HARDWARE/INFORMATION

    The comp.speech FAQ provides information on a range of software, hardware
   and resources.

Q1.7: Speech Data

          * BUPT Spoken Digit Database (Chinese)
          * Center for Spoken Language Understanding (CSLU)
          * Linguistic Data Consortium (about 20 corpora)
          * NOISEX
          * Oxford Acoustic Phonetic Database
          * PhonDat - Database of Spoken German
          * Phonemic Samples

Q1.9: Speech Processing Environments

          * CSRE: Canadian Speech Research Environment
          * Entropic Signal Processing System (ESPS) and Waves
          * Kay Elemetrics Computer Speech Lab
          * Khoros
          * Matlab plus Signal Processing Toolbox
          * MacSpeech Lab II
          * N!Power
          * OGI Speech Tools
          * Ptolemy
          * Signalyze 3.0

Q1.10: Miscelaneous Software and Resources

  NETWORK "PHONE" SOFTWARE

          * NEVOT (1.4v) from AT&T BL
          * Internet Phone from VocalTec

  AUDIO PROCESSING SOFTWARE

          * AF version AF3R1
          * MixViews
          * Network Audio System Release 1.1
          * NIST Sphere Library

  HUMAN AUDIO PERCEPTION

          * Auditory Modeller 1
          * Auditory Modeller 2
          * Auditory Toolbox for Matlab
          * Human Audio Perception Document

  DICTIONARIES AND OTHER LEXICAL TOOLS

          * BEEP dictionary
          * CMU dictionary
          * CUVOLAD dictionary
          * Dictionary
          * Homophone List
          * MRC database
          * Dictionaries on the WWW

  PHONETIC FONTS

          * Summer Institute of Linguistics IPA Fonts

Q2.6: Audio Hardware

          * Macintosh Audio Hardware
          * PC Audio Hardware
          * Unix Audio Hardware

Q3.3: Compression Software and Hardware

          * 32 kbps ADPCM
          * CELP 3.2a & LPC
          * 8 Kbit/s CELP on the TMS320C5x family of DSP chips
          * File format conversion
          * G.711/721/723 Compression
          * G.728 LD-CELP vocoder
          * G.728 Compression
          * GSM 06.10 Compression
          * Lernout & Hauspie Speech Coding (5 products)
          * Lernout & Hauspie Speech Coding SDK
          * shorten - a lossless compressor for speech signals
          * U.S.F.S. 1016 CELP vocoder for DSP56001

Q4.2: Natural Language Processing

     * Natural Language Software Registry (NLSR) - NLP Tools
     * Part of Speech Tagger

Q5.5: Speech Synthesis

          * AsTeR
          * TheBigMouth
          * CSRE: Canadian Speech Research Environment
          * DECTalk
          * Eloquence
          * Infovox Product Range
          * JSRU
          * Klatt-style synthesiser
          * Lernout and Hauspie Text-To-Speech (3 products)
          * Lernout and Hauspie Text-To-Speech Windows SDK
          * Mac Speech Output Applications
          * MacinTalk
          * Monologue
          * Narrator
          * TextToSpeech Kit (NeXT)
          * Orator from Bellcore
          * rsynth
          * SENSYN speech synthesizer
          * SGI Developers Toolbox Synthesiser
          * SIMTEL
          * Sound Bytes DeveloperUs Kit
          * spchsyn.exe
          * Speak
          * Speech Manager and PlainTalk
          * Text to Phoneme Program 1
          * Text to phoneme program 2
          * Text to phoneme program 3
          * Tinytalk
          * TrueTalk
          * Text to speech program

Q6.5: Speech Recognition

          * BBN Hark Recogniser
          * Corona
          * Custom Voice(TM) by A&G Graphics Interface
          * D6006 Voice Control Processor
          * DATAVOX - French
          * DragonDictate version 3.0
          * DragonDictate for Windows
          * DragonVoiceTools
          * DSP Semiconductor Recognition Chip
          * EARS: Single Word Recognition Package
          * HM2007 - Speech Recognition Chip
          * Hidden Markov Model Toolkit (HTK) from Entropic 
          * IBM VoiceType Dictation
          * ICSS system from IBM
          * IN3 Voice Command
          * IN3 Voice Command for Windows
          * Kurzweil Voice for Windows
          * Lernout & Hauspie ASR (3 products)
          * Lernout & Hauspie ASR SDK
          * Listen for Windows - Verbex Voice Systems
          * Lotec Speech Recognition Package
          * Myers' Hidden Markov Model software
          * OKI VRP6679 - Speech Recognition Chip
          * Speech Systems Phonetic Engine 400 (PE400)
          * Speech Systems Phonetic Engine 500 (PE500)
          * PowerSecretary
          * recnet
          * SayIt
          * Simon Says - for NeXT
          * Speech Commander - Verbex Voice Systems
          * Voice Command Line Interface
          * Visus SpeechKit
          * Voice-Trek 2.0
          * Creative VoiceAssist
          * Voice Blaster Ver. 4.0
          * VoiceServer for Windows
          * Votan




 ---

Andrew Hunt                      andrew@itl.atr.co.jp
Interpreting Telecommunications Research Laboratories
Advanced Telecommunications Research Institute 
Hikari-dai 2-2, Seika-cho, Kyoto 619-02 Japan
Tel 07749 5 1390, Fax: 07749 5 1308
