Main Page   Namespace List   Class Hierarchy   Compound List   File List   Namespace Members   Compound Members   File Members   Related Pages  

Structured Query Evaluation Application

This application (StructQueryEval.cpp) runs retrieval experiments to evaluate the performance of the structured query model using the inquery retrieval method. Feedback is implemented as a WSUM of the original query combined with terms selected using the Rocchio implementation of the TFIDF retrieval method. The expanded query has the form:

wsum( (1-a) <original query>
      a*w1  t1
      a*w2  t2
      ...
      a*wN  tN
      )


where a is the value of the parameter feedbackPosCoeff.

Scoring is either done over a working set of documents (essentially re-ranking), or over the whole collection. This is indicated by the parameter "useWorkingSet". When "useWorkingSet" has a non-zero (integer) value, scoring will be on a working set specified in a file given by "workSetFile". The file should have three columns. The first is the query id; the second the document id; and the last a numerical value, which is ignored. By default, scoring is on the whole collection.

The parameters are:

  1. index: The complete name of the index table-of-content file for the database index.

  2. QuerySet: the query text stream parsed by ParseInQuery

  3. resultFile: the result file

  4. resultCount: the number of documents to return as result for each query
  5. DefaultBelief: The default belief for a document: Default=0.4
  6. feedbackDocCount: the number of docs to use for pseudo-feedback (0 means no-feedback)

  7. feedbackTermCount: the number of terms to add to a query when doing feedback.
  8. feedbackPosCoeff: the coefficient for positive terms in (positive) Rocchio feedback, as implemented for TFIDF.

Generated on Mon Sep 30 14:14:32 2002 for LEMUR by doxygen1.2.18