This application ( GenL2Norm.cpp
) generates a support file for retrieval using the cosine similarity. The file contains the L2 norms for each document, used to speed up the retrieval process.
To run the application, follow the general steps of running a lemur application and set the following variables in the parameter file:
(1) index
: the table-of-content (TOC) record file of the index (e.g., the .bsc
file created by BuildBasicIndex
or the .ifp
file created by PushIndexer
. )
(2) L2File
: file path for the support file (e.g., /usr0/mydata/index.L2
)
This application is also a good example of using the doc index (i.e., doc->term index).