|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectinfo.ephyra.indexing.AQUAINT2Preprocessor
public class AQUAINT2Preprocessor
A preprocessor for the AQUAINT-2 corpus:
| Field Summary | |
|---|---|
private static java.lang.String |
dir
Directory of the AQUAINT corpus |
| Constructor Summary | |
|---|---|
AQUAINT2Preprocessor()
|
|
| Method Summary | |
|---|---|
private static boolean |
addParagraphTags()
Adds paragraph tags to documents of type 'multi', 'advis' and 'other'. |
private static boolean |
convertToTrectext()
Converts the documents to the 'trectext' format required by Indri. |
static void |
main(java.lang.String[] args)
Entry point of the program. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
private static java.lang.String dir
| Constructor Detail |
|---|
public AQUAINT2Preprocessor()
| Method Detail |
|---|
private static boolean addParagraphTags()
private static boolean convertToTrectext()
public static void main(java.lang.String[] args)
Entry point of the program.
Preprocesses the AQUAINT-2 corpus.
args - argument 1: directory of the AQUAINT-2 corpus
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||