Date: Mon, 24 Jul 95 15:57:37 EDT
From: LDC Office
Posted-Date: Mon, 24 Jul 95 15:57:37 EDT
Message-Id: <9507241957.AA17013@unagi.cis.upenn.edu>
To: ldc-members@unagi.cis.upenn.edu
Subject: Pronlex Version 0.2
COMLEX English Pronouncing Lexicon (PRONLEX)
A new edition, Version 0.2, of the LDC's COMLEX English Pronouncing
Lexicon, also known as PRONLEX, is now available by ftp to current
(1995) LDC members who have signed license agreements. (Instructions
for retrieval are provided on receipt of license agreements. If you
received the previous version since September, the same instructions
will work now.)
Version 0.2 contains 90,694 entries, adding any missing forms of the
lemmas from COMLEX Syntax to the existing coverage of WSJ30K, WSJ64K,
and Switchboard. This version also contains a number of corrections
and revisions to the previous release (0.1 in February 1995).
The PRONLEX documentation, PRONUNCIATION, which is accessible by
anonymous ftp at the address listed below, describes the principles
observed for word transcription and the treatment of pronunciation
variation. Please see the README file for instructions on providing
much-valued feedback on the lexicon.
PRONLEX Version 0.2 was created under the direction of Cynthia
McLemore at the Linguistic Data Consortium, with research assistant
Paul Kingsbury coordinating transcription activities. It is part of
the COMmon LEXical Database of English (COMLEX), a project which is
also producing or co-producing a dictionary of syntactic features
(COMLEX Syntax), a database of word senses (Wordnet, available free
from Princeton), and several annotated corpora linked to one or more
of these.
License forms for the Pronouncing and Syntax dictionaries are
available by ftp in either postscript or latex form, at
ftp.cis.upenn.edu, in the directory pub/ldc/license_forms. LDC
members receive PRONLEX free; nonmembers may purchase a research-use
license for $10000.
[Payment information may be obtained from Sarah Parnum at
ldc@unagi.cis.upenn.edu.]