#include <ReutersParser.hpp>
Inheritance diagram for ReutersParser:
Public Methods | |
ReutersParser () | |
void | parseFile (char *filename) |
Parse a file. | |
void | parseBuffer (char *buf, int len) |
Parse a buffer. | |
long | fileTell () |
return the current byte position of the file being parsed | |
Private Methods | |
void | doParse () |
Actual parsing action flow. | |
Private Attributes | |
int | state |
The state of the parser. |
U.S.A., USA's, and USAs are converted to USA. Does not recognize acronyms with numbers.
The following fields are parsed: text, headline, title
|
|
|
Actual parsing action flow.
|
|
return the current byte position of the file being parsed
Implements Parser. |
|
Parse a buffer.
Implements Parser. |
|
Parse a file.
Implements Parser. |
|
The state of the parser.
|