This application builds an InvFP index for a collection of documents with properties associated with terms.
To use it, follow the general steps of running a lemur application.
The parameters are:
index
: name of the index to create (don't include extension) indexType
:the type of index to create, "key" (KeyfileIncIndex) or "inv" (InvFPIndex). default is inv memory
: memory (in bytes) of InvFPPushIndex cache (def = 96000000). stopwords
: name of file containing the stopword list. acronyms
: name of file containing the acronym list. countStopWords
: If true, count stopwords in document length. docFormat
:
stemmer
: KstemmerDir
: Path to directory of data files used by Krovetz's stemmer. arabicStemDir
: Path to directory of data files used by the Arabic stemmers. arabicStemFunc
: Which stemming algorithm to apply, one of: dataFiles
: name of file containing list of datafiles to index.