org.htmlparser.visitors
Class TextExtractingVisitor
java.lang.Object
org.htmlparser.visitors.NodeVisitor
org.htmlparser.visitors.TextExtractingVisitor
- public class TextExtractingVisitor
- extends NodeVisitor
Extracts text from a web page.
Usage:
Parser parser = new Parser(...);
TextExtractingVisitor visitor = new TextExtractingVisitor();
parser.visitAllNodesWith(visitor);
String textInPage = visitor.getExtractedText();
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
TextExtractingVisitor
public TextExtractingVisitor()
getExtractedText
public java.lang.String getExtractedText()
visitStringNode
public void visitStringNode(Text stringNode)
- Description copied from class:
NodeVisitor
- Called for each
StringNode visited.
- Overrides:
visitStringNode in class NodeVisitor
- Parameters:
stringNode - The string node being visited.
visitTag
public void visitTag(Tag tag)
- Description copied from class:
NodeVisitor
- Called for each
Tag visited.
- Overrides:
visitTag in class NodeVisitor
- Parameters:
tag - The tag being visited.
visitEndTag
public void visitEndTag(Tag tag)
- Description copied from class:
NodeVisitor
- Called for each
Tag visited that is an end tag.
- Overrides:
visitEndTag in class NodeVisitor
- Parameters:
tag - The end tag being visited.