Index of /afs/cs.cmu.edu/academic/class/15440-f11/code/class17/wordcount
Name Last modified Size Description
Parent Directory -
WordCountDriver.java 16-Jul-2007 13:30 1.0K
WordCountMapper.java 16-Jul-2007 13:42 957
WordCountReducer.java 13-Jul-2007 16:18 780
This directory contains a Hadoop implementation of wordcount:
Generating a list of all words in a set of documents, along with
counts of the number of times each occurs.
The main program, WordCountDriver, takes as arguments the pathname of
a directory containing input files, and a pathname of a directory
where it should store its result files.
The code was written for Hadoop version 0.13.0. It is not guaranteed
to work with more recent versions of Hadoop.
Files:
WordCountDriver.java: The main program
WordCountMapper.java: Implements the map function
WordCountReducer.java: Implements the reduce function