TagSoupDocumentParser

public final class TagSoupDocumentParser


Uses TagSoup to parse html into Documents.

Summary

Public methods

static TagSoupDocumentParser
Document
parse(String html)

Parses the given html into an Document.

Public methods

newInstance

public static TagSoupDocumentParser newInstance()
Throws
org.xml.sax.SAXNotRecognizedException org.xml.sax.SAXNotRecognizedException
org.xml.sax.SAXNotSupportedException org.xml.sax.SAXNotSupportedException

parse

public Document parse(String html)

Parses the given html into an Document.

Throws
org.xml.sax.SAXException org.xml.sax.SAXException
java.io.IOException java.io.IOException