|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectinfo.ephyra.indexing.AQUAINTPreprocessor
public class AQUAINTPreprocessor
A preprocessor for the AQUAINT corpus:
| Field Summary | |
|---|---|
private static java.lang.String |
dir
Directory of the AQUAINT corpus |
| Constructor Summary | |
|---|---|
AQUAINTPreprocessor()
|
|
| Method Summary | |
|---|---|
private static boolean |
addParagraphTags()
Adds paragraph tags if missing. |
static void |
main(java.lang.String[] args)
Entry point of the program. |
private static boolean |
splitParagraphs()
Splits paragraphs, e.g. to separate publisher details. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
private static java.lang.String dir
| Constructor Detail |
|---|
public AQUAINTPreprocessor()
| Method Detail |
|---|
private static boolean addParagraphTags()
private static boolean splitParagraphs()
public static void main(java.lang.String[] args)
Entry point of the program.
Preprocesses the AQUAINT corpus.
args - argument 1: directory of the AQUAINT corpus
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||