textminer.text
Class DictionaryMaker

java.lang.Object
  |
  +--textminer.text.DictionaryMaker
All Implemented Interfaces:
InterOutcomes

public final class DictionaryMaker
extends java.lang.Object
implements InterOutcomes

The DictionaryMaker class is responsible for generating the unique term dictionary of the given text document collection.

Since:
0.1
Version:
TextMiner 1.1
Author:
Young-Woo Seo (ywseo@cs.cmu.edu)

Field Summary
 
Fields inherited from interface textminer.core.InterOutcomes
ext_condensed_index_file, ext_corpus_stat_file, ext_dvec_file, ext_fsmethod_file, ext_index_file, ext_judgment_file, ext_lexicon_file, ext_matrix_file, ext_model_file, ext_output_file, ext_result_file, ext_termdic_file, ext_vec_index_file
 
Constructor Summary
DictionaryMaker(CorpusIndex index, java.util.Vector activeclasses, java.lang.String data_repository, java.lang.String result_repository, java.lang.String alias, boolean verbose)
          Constructor of DictionaryMaker
 
Method Summary
 boolean isExistTdicfiles()
          Return true if term dictionary files exist
 void readIndex(int option)
          Read instances from CorpusIndex
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DictionaryMaker

public DictionaryMaker(CorpusIndex index,
                       java.util.Vector activeclasses,
                       java.lang.String data_repository,
                       java.lang.String result_repository,
                       java.lang.String alias,
                       boolean verbose)
Constructor of DictionaryMaker

Parameters:
index - a copy of index object for a given data set
activeclasses - true target lables
data_repository - path of data directory
result_repository - path of result directory
alias - task alias
verbose -
Method Detail

isExistTdicfiles

public boolean isExistTdicfiles()
Return true if term dictionary files exist


readIndex

public void readIndex(int option)
Read instances from CorpusIndex

Parameters:
option - specify a particular data set, 0: financial data set