CMU Artificial Intelligence Repository
Home INFO Search FAQs Repository Root

English Lexicon

areas/nlp/corpora/dicts/words/
This directory contains a list of over 100,000 English words transcribed orthographically. The list originally came from Public Brand Software. The original list contained 146,440 words, but contained thousands of duplicate words. Evan Antworth resorted the list and removed the duplicates using the Unix utility uniq. The total number of words is now 109,582. This word list includes inflected forms, such as plural nouns and the -s, -ed and -ing forms of verbs. Thus the number of lexical stems represented in the list is considerably smaller than the total number of words.
Origin:   

   /afs/umich.edu/group/itd/archive/linguistics/lexica

Version: 6-SEP-91 CD-ROM: Prime Time Freeware for AI, Issue 1-1 Contact: Evan Antworth Academic Computing Department Summer Institute of Linguistics 7500 W. Camp Wisdom Road Dallas, TX 75236 U.S.A. Tel: 214-709-2418 Fax: 214-709-3387 Keywords: Authors!Antworth, Corpora, Dictionaries!English, English Lexicon, NLP References: ?
Last Web update on Mon Feb 13 10:26:05 1995
AI.Repository@cs.cmu.edu