Class TextExtractingVisitor

java.lang.Object
org.htmlparser.visitors.NodeVisitor
org.htmlparser.visitors.TextExtractingVisitor

public class TextExtractingVisitor extends NodeVisitor
Extracts text from a web page. Usage: Parser parser = new Parser(...); TextExtractingVisitor visitor = new TextExtractingVisitor(); parser.visitAllNodesWith(visitor); String textInPage = visitor.getExtractedText();
  • Constructor Details

    • TextExtractingVisitor

      public TextExtractingVisitor()
  • Method Details

    • getExtractedText

      public String getExtractedText()
    • visitStringNode

      public void visitStringNode(Text stringNode)
      Description copied from class: NodeVisitor
      Called for each StringNode visited.
      Overrides:
      visitStringNode in class NodeVisitor
      Parameters:
      stringNode - The string node being visited.
    • visitTag

      public void visitTag(Tag tag)
      Description copied from class: NodeVisitor
      Called for each Tag visited.
      Overrides:
      visitTag in class NodeVisitor
      Parameters:
      tag - The tag being visited.
    • visitEndTag

      public void visitEndTag(Tag tag)
      Description copied from class: NodeVisitor
      Called for each Tag visited that is an end tag.
      Overrides:
      visitEndTag in class NodeVisitor
      Parameters:
      tag - The end tag being visited.