Newsgroups: alt.usage.english,sci.lang
Path: cantaloupe.srv.cs.cmu.edu!bb3.andrew.cmu.edu!newsfeed.pitt.edu!newsflash.concordia.ca!news.nstn.ca!ott.istar!istar.net!van.istar!west.istar!n1van.istar!van-bc!unixg.ubc.ca!info.ucla.edu!agate!howland.erols.net!torn!sq!lee
From: lee@sq.com (Liam R. E. Quin)
Subject: Re: English word frequency
Message-ID: <1996Aug11.030119.7573@sq.com>
Organization: SoftQuad Inc., Toronto, Canada
X-Feet: bare, naked, happy.  Please remove your shoes now.
References: <4uf2cv$716@yama.mcc.ac.uk> <wilbadenDvxI02.xw@netcom.com>
Date: Sun, 11 Aug 1996 03:01:19 GMT
Lines: 19

W.Baden <wilbaden@netcom.com> wrote:
> Try ftp://vaxsar.vassar.edu/pub/nlp.dir

should be
> Try ftp://vaxsar.vassar.edu/nlp/

I'm not sure of the provenance of these wordlists; they seem a little odd.
(e.g. kwc gives `said' as the 11th most common word (36419 occurrences;
it gives . as most common with 344284, and `the' with 265693; `The' is
listed separately with 45015).

Lee


-- 
Liam Quin, SoftQuad Inc    | lq-text freely available Unix text retrieval
lee@sq.com +1 416 239 4801 | FAQs: Metafont fonts, OPEN LOOK UI, OpenWindows
SGML: http://www.sq.com/   |`Consider yourself... one of the family...
The barefoot programmer    | consider yourself... At Home!' [the Artful Dodger]
