The indri applications, BuildIndriIndex, IndriDaemon, and IndriRunQuery accept parameters from either the command line or from a file. The parameter file uses an XML format. The command line uses dotted path notation. The top level element in the parameters file is named parameters.
-memory=100M on the command line. -index=/path/to/repository on the command line.
-corpus.path=/path/to/file_or_directory on the command line. -corpus.class=trecweb on the command line. The known classes are: field entry specifying the metadata fields to index, eg DOCNO. Specified as <metadata><field>fieldname</field></metadata> in the parameter file and as metadata.field=fieldname on the command line. -field.name=fieldname on the command line. true if the field contains numeric data, otherwise the symbol false, specified as <field><numeric>true</numeric></field> in the parameter file and as -field.numeric=true on the command line. This is an optional parameter, defaulting to false. Note that 0 can be used for false and 1 can be used for true.
-stemmer.name=stemmername on the command line. This is an optional parameter with the default of no stemming. -stopper.word=stopword on the command line. This is an optional parameter with the default of no stopping.
-memory=100M on the command line. -index=/path/to/repository on the command line. This element can be specified multiple times to combine Repositories. -server=hostname on the command line. The hostname can include an optional port number to connect to, using the form hostname:portnum. This element can be specified multiple times to combine servers. -count=number on the command line.
( key ":" value ) [ "," key ":" value ]*
Here's an example rule in command line format:
-rule=method:linear,collectionLambda:0.2,field:title
and in parameter file format:
<rule>method:linear,collectionLambda:0.2,field:title</rule>
This corresponds to Jelinek-Mercer smoothing with background lambda equal to 0.2, only for items in a title field.
If nothing is listed for a key, all values are assumed. So, a rule that does not specify a field matches all fields. This makes -rule=method:linear,collectionLambda:0.2 a valid rule.
Valid keys:
Valid methods:
-stopper.word=stopword on the command line. This is an optional parameter with the default of no stopping.
-queryOffset=number on the command line. -runID=someID on the command line. true to produce TREC scorable output, otherwise the symbol false. Specified as <trecFormat>true</trecFormat> in the parameter file and as -trecFormat=true on the command line. Note that 0 can be used for false, and 1 can be used for true.
-fbDocs=number on the command line. -fbTerms=number on the command line. -fbMu=number on the command line. -fbOrigWeight=number on the command line.
-memory=100M on the command line. -index=/path/to/repository on the command line. -port=number on the command line.
1.2.18