#include <SimpleKLDocModel.hpp>
Inheritance diagram for SimpleKLDocModel:

Public Methods | |
| SimpleKLDocModel (int docID, UnigramLM &collectLM) | |
| ~SimpleKLDocModel () | |
| virtual double | termWeight (int termID, DocInfo *info) |
| term weighting function, weight(w) = p_seen(w)/p_unseen(w) | |
| virtual double | scoreConstant () |
| doc-specific constant term in the scoring formula | |
| virtual double | unseenCoeff ()=0 |
| a(d) | |
| virtual double | seenProb (double termFreq, int termID)=0 |
| p(w|d), w seen | |
Protected Attributes | |
| UnigramLM & | refLM |
abstract interface of doc representation for smoothed document unigram model
adapt a smoothed document language model interface to a DocumentRep interface
p(w|d) = q(w|d) if w seen
= a(d) * Pc(w) if w unseen
where, a(d) controls the probability mass allocated to all unseen words and Pc(w) is the collection language model
|
||||||||||||
|
|
|
|
|
|
|
doc-specific constant term in the scoring formula
Implements DocumentRep. |
|
||||||||||||
|
p(w|d), w seen
Implemented in JelinekMercerDocModel, DirichletPriorDocModel, AbsoluteDiscountDocModel, and TwoStageDocModel. |
|
||||||||||||
|
term weighting function, weight(w) = p_seen(w)/p_unseen(w)
Implements DocumentRep. |
|
|
a(d)
Implemented in JelinekMercerDocModel, DirichletPriorDocModel, AbsoluteDiscountDocModel, and TwoStageDocModel. |
|
|
|
1.2.18