Main Page   Namespace List   Class Hierarchy   Compound List   File List   Namespace Members   Compound Members   File Members   Related Pages  

IncPassagePushIndex Class Reference

Incremental passage push index. More...

#include <IncPassagePushIndex.hpp>

Inheritance diagram for IncPassagePushIndex:

IncFPPushIndex InvFPPushIndex InvPushIndex PushIndex List of all members.

Public Methods

 IncPassagePushIndex (char *prefix="DefaultIndex", int psgSize=50, int cachesize=128000000, long maxfilesize=2100000000)
 IncPassagePushIndex (int psgSize)
virtual ~IncPassagePushIndex ()
void setPassageSize (int n)
 Set the number of terms per passage.

virtual bool beginDoc (DocumentProps *dp)
 the beginning of a new document, returns true if initiation was successful

virtual bool addTerm (Term &t)
 adding a term to the current document, returns true if term was added successfully.

virtual void doendDoc (DocumentProps *dp, int mgrid)
 signify the end of current document


Private Attributes

int passageSize
 size of passages, in terms.

int passageEnd
 passageSize/2 for overlapping passages.

int psgCounter
 number of passages so far this document.

DocumentPropscurDoc
 the current document

char * curDocName
 the current document name


Detailed Description

Incremental passage push index.

This class builds a incremental passage index assigning termids, docids, tracking locations of term within passages (stopwords not counted). Documents are segmented into passages of size passageSize with an overlap of passageSize/2 terms per passage. It expects that all stopping and stemming (if any) occurs before the term is passed in.


Constructor & Destructor Documentation

IncPassagePushIndex::IncPassagePushIndex char *    prefix = "DefaultIndex",
int    psgSize = 50,
int    cachesize = 128000000,
long    maxfilesize = 2100000000
 

IncPassagePushIndex::IncPassagePushIndex int    psgSize
 

IncPassagePushIndex::~IncPassagePushIndex   [virtual]
 


Member Function Documentation

bool IncPassagePushIndex::addTerm Term   t [virtual]
 

adding a term to the current document, returns true if term was added successfully.

Reimplemented from InvFPPushIndex.

bool IncPassagePushIndex::beginDoc DocumentProps   dp [virtual]
 

the beginning of a new document, returns true if initiation was successful

Reimplemented from InvPushIndex.

void IncPassagePushIndex::doendDoc DocumentProps   dp,
int    mgrid
[virtual]
 

signify the end of current document

Reimplemented from InvFPPushIndex.

void IncPassagePushIndex::setPassageSize int    n
 

Set the number of terms per passage.


Member Data Documentation

DocumentProps* IncPassagePushIndex::curDoc [private]
 

the current document

char* IncPassagePushIndex::curDocName [private]
 

the current document name

int IncPassagePushIndex::passageEnd [private]
 

passageSize/2 for overlapping passages.

int IncPassagePushIndex::passageSize [private]
 

size of passages, in terms.

int IncPassagePushIndex::psgCounter [private]
 

number of passages so far this document.


The documentation for this class was generated from the following files:
Generated on Fri Feb 6 07:12:01 2004 for LEMUR by doxygen1.2.16