This application (StructQueryEval.cpp) runs retrieval experiments to evaluate the performance of the structured query model using the inquery retrieval method. Feedback is implemented as a WSUM of the original query combined with terms selected using the Rocchio implementation of the TFIDF retrieval method. The expanded query has the form:
wsum( (1-a) <original query> a*w1 t1 a*w2 t2 ... a*wN tN )
feedbackPosCoeff
.
Scoring is either done over a working set of documents (essentially re-ranking), or over the whole collection. This is indicated by the parameter "useWorkingSet". When "useWorkingSet" has a non-zero (integer) value, scoring will be on a working set specified in a file given by "workSetFile". The file should have three columns. The first is the query id; the second the document id; and the last a numerical value, which is ignored. By default, scoring is on the whole collection.
The parameters are:
index
: The complete name of the index table-of-content file for the database index.
QuerySet
: the query text stream parsed by ParseInQuery
resultFile
: the result file
resultCount
: the number of documents to return as result for each query DefaultBelief
: The default belief for a document: Default=0.4 feedbackDocCount
: the number of docs to use for pseudo-feedback (0 means no-feedback)
feedbackTermCount
: the number of terms to add to a query when doing feedback. feedbackPosCoeff
: the coefficient for positive terms in (positive) Rocchio feedback, as implemented for TFIDF.