|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectjava.lang.Thread
info.ephyra.search.searchers.Searcher
info.ephyra.search.searchers.KnowledgeMiner
info.ephyra.search.searchers.IndriKM
public class IndriKM
A KnowledgeMiner that deploys the Indri IR system to
search a local text corpus. The search results are paragraphs.
It runs as a separate thread, so several queries can be performed in parallel.
This class extends the class KnowledgeMiner.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from class java.lang.Thread |
|---|
java.lang.Thread.State, java.lang.Thread.UncaughtExceptionHandler |
| Field Summary | |
|---|---|
private static java.lang.String |
FORBIDDEN_CHAR
Regular expression that matches characters that cause problems in Indri queries and thus should be removed from query strings. |
private java.lang.String[] |
indriDirs
Directories of Indri indices. |
private java.lang.String[] |
indriUrls
URLs of Indri servers. |
private static int |
MAX_DOCS
Maximum number of documents fetched at a time. |
private static int |
MAX_RESULTS_PERQUERY
Maximum number of search results per query. |
private static int |
MAX_RESULTS_TOTAL
Maximum total number of search results. |
| Fields inherited from class info.ephyra.search.searchers.KnowledgeMiner |
|---|
firstResult, maxResults |
| Fields inherited from class info.ephyra.search.searchers.Searcher |
|---|
query, results |
| Fields inherited from class java.lang.Thread |
|---|
MAX_PRIORITY, MIN_PRIORITY, NORM_PRIORITY |
| Constructor Summary | |
|---|---|
IndriKM(java.lang.String[] locations,
boolean isServers)
Creates a new Indri knowledge miner and sets the directories of indices or the URLs of servers. |
|
| Method Summary | |
|---|---|
protected Result[] |
doSearch()
Queries the Indri indices or servers and returns an array containing up to MAX_RESULTS_PERQUERY search results. |
KnowledgeMiner |
getCopy()
Returns a new instance of IndriKM. |
static java.lang.String[][] |
getIndriIndices()
Gets a list of all Indri index directories that have been specified with environment variables 'INDRI_INDEX', 'INDRI_INDEX2', 'INDRI_INDEX3' etc. |
static java.lang.String[][] |
getIndriServers()
Gets a list of all Indri server URLs that have been specified with environment variables 'INDRI_SERVER', 'INDRI_SERVER2', 'INDRI_SERVER3' etc. |
protected int |
getMaxResultsPerQuery()
Returns the maximum number of search results per query. |
protected int |
getMaxResultsTotal()
Returns the maximum total number of search results. |
static java.lang.String |
transformQueryString(java.lang.String qs)
Returns a representation of the query string that is suitable for Indri. |
| Methods inherited from class info.ephyra.search.searchers.KnowledgeMiner |
|---|
getResults, getResults, start, start |
| Methods inherited from class info.ephyra.search.searchers.Searcher |
|---|
run |
| Methods inherited from class java.lang.Thread |
|---|
activeCount, checkAccess, countStackFrames, currentThread, destroy, dumpStack, enumerate, getAllStackTraces, getContextClassLoader, getDefaultUncaughtExceptionHandler, getId, getName, getPriority, getStackTrace, getState, getThreadGroup, getUncaughtExceptionHandler, holdsLock, interrupt, interrupted, isAlive, isDaemon, isInterrupted, join, join, join, resume, setContextClassLoader, setDaemon, setDefaultUncaughtExceptionHandler, setName, setPriority, setUncaughtExceptionHandler, sleep, sleep, start, stop, stop, suspend, toString, yield |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Field Detail |
|---|
private static final int MAX_RESULTS_TOTAL
private static final int MAX_RESULTS_PERQUERY
private static final int MAX_DOCS
private static final java.lang.String FORBIDDEN_CHAR
Regular expression that matches characters that cause problems in Indri queries and thus should be removed from query strings.
Indri allows the following characters:
private java.lang.String[] indriDirs
private java.lang.String[] indriUrls
| Constructor Detail |
|---|
public IndriKM(java.lang.String[] locations,
boolean isServers)
locations - directories of indices or URLs of serversisServers - true iff the first parameter provides URLs
of servers| Method Detail |
|---|
public static java.lang.String[][] getIndriIndices()
public static java.lang.String[][] getIndriServers()
public static java.lang.String transformQueryString(java.lang.String qs)
qs - query string
protected int getMaxResultsTotal()
getMaxResultsTotal in class KnowledgeMinerprotected int getMaxResultsPerQuery()
getMaxResultsPerQuery in class KnowledgeMinerprotected Result[] doSearch()
MAX_RESULTS_PERQUERY search results.
doSearch in class Searcherpublic KnowledgeMiner getCopy()
IndriKM. A new instance is created
for each query.
getCopy in class KnowledgeMinerIndriKM
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||