CorpusBuilder Tagalog Corpus

The documents in this corpus were collected in January 2001 by the CorpusBuilder system. They were all filtered using van Noord's TextCat language filter. A document is included if TextCat assigned Tagalog as the most probable language. Some documents may contain small amounts of English or other languages, or may be in dialects of Tagalog such as Cebuano. No manual filtering has been performed on these pages. For copyright reasons, we include here only the URLs of the pages. CorpusBuilder, by Ghani, Jones and Mladenic
Rosie Jones (rosie AT cs.cmu.edu)
Last modified: Sat Feb 24 17:04:45 EST 2001