|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.xerces.util.ParserConfigurationSettings
org.cyberneko.html.HTMLConfiguration
public class HTMLConfiguration
An XNI-based parser configuration that can be used to parse HTML documents. This configuration can be used directly in order to parse HTML documents or can be used in conjunction with any XNI based tools, such as the Xerces2 implementation.
This configuration recognizes the following features:
This configuration recognizes the following properties:
For complete usage information, refer to the documentation.
HTMLScanner
,
HTMLTagBalancer
,
HTMLErrorReporter
Nested Class Summary | |
---|---|
protected class |
HTMLConfiguration.ErrorReporter
Defines an error reporter for reporting HTML errors. |
Field Summary | |
---|---|
protected static java.lang.String |
AUGMENTATIONS
Include infoset augmentations. |
protected static java.lang.String |
BALANCE_TAGS
Balance tags. |
protected static java.lang.String |
ERROR_DOMAIN
Error domain. |
protected static java.lang.String |
ERROR_REPORTER
Error reporter. |
protected boolean |
fCloseStream
Stream opened by parser. |
protected org.apache.xerces.xni.XMLDocumentHandler |
fDocumentHandler
Document handler. |
protected HTMLScanner |
fDocumentScanner
Document scanner. |
protected org.apache.xerces.xni.XMLDTDContentModelHandler |
fDTDContentModelHandler
DTD content model handler. |
protected org.apache.xerces.xni.XMLDTDHandler |
fDTDHandler
DTD handler. |
protected org.apache.xerces.xni.parser.XMLEntityResolver |
fEntityResolver
Entity resolver. |
protected org.apache.xerces.xni.parser.XMLErrorHandler |
fErrorHandler
Error handler. |
protected HTMLErrorReporter |
fErrorReporter
Error reporter. |
protected java.util.Vector |
fHTMLComponents
Components. |
protected static java.lang.String |
FILTERS
Pipeline filters. |
protected java.util.Locale |
fLocale
Locale. |
protected NamespaceBinder |
fNamespaceBinder
Namespace binder. |
protected HTMLTagBalancer |
fTagBalancer
HTML tag balancer. |
protected static java.lang.String |
NAMES_ATTRS
Modify HTML attribute names: { "upper", "lower", "default" }. |
protected static java.lang.String |
NAMES_ELEMS
Modify HTML element names: { "upper", "lower", "default" }. |
protected static java.lang.String |
NAMESPACES
Namespaces. |
protected static java.lang.String |
REPORT_ERRORS
Report errors. |
protected static java.lang.String |
SIMPLE_ERROR_FORMAT
Simple report format. |
protected static boolean |
XERCES_2_0_0
Parser version is Xerces 2.0.0. |
protected static boolean |
XERCES_2_0_1
Parser version is Xerces 2.0.1. |
protected static boolean |
XML4J_4_0_x
Parser version is XML4J 4.0.x. |
Fields inherited from class org.apache.xerces.util.ParserConfigurationSettings |
---|
fFeatures, fParentSettings, fProperties, fRecognizedFeatures, fRecognizedProperties, PARSER_SETTINGS |
Constructor Summary | |
---|---|
HTMLConfiguration()
Default constructor. |
Method Summary | |
---|---|
protected void |
addComponent(HTMLComponent component)
Adds a component. |
void |
cleanup()
If the application decides to terminate parsing before the xml document is fully parsed, the application should call this method to free any resource allocated during parsing. |
org.apache.xerces.xni.XMLDocumentHandler |
getDocumentHandler()
Returns the document handler. |
org.apache.xerces.xni.XMLDTDContentModelHandler |
getDTDContentModelHandler()
Returns the DTD content model handler. |
org.apache.xerces.xni.XMLDTDHandler |
getDTDHandler()
Returns the DTD handler. |
org.apache.xerces.xni.parser.XMLEntityResolver |
getEntityResolver()
Returns the entity resolver. |
org.apache.xerces.xni.parser.XMLErrorHandler |
getErrorHandler()
Returns the error handler. |
java.util.Locale |
getLocale()
Returns the locale. |
boolean |
parse(boolean complete)
Parses the document in a pull parsing fashion. |
void |
parse(org.apache.xerces.xni.parser.XMLInputSource source)
Parses a document. |
void |
pushInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Pushes an input source onto the current entity stack. |
protected void |
reset()
Resets the parser configuration. |
void |
setDocumentHandler(org.apache.xerces.xni.XMLDocumentHandler handler)
Sets the document handler. |
void |
setDTDContentModelHandler(org.apache.xerces.xni.XMLDTDContentModelHandler handler)
Sets the DTD content model handler. |
void |
setDTDHandler(org.apache.xerces.xni.XMLDTDHandler handler)
Sets the DTD handler. |
void |
setEntityResolver(org.apache.xerces.xni.parser.XMLEntityResolver resolver)
Sets the entity resolver. |
void |
setErrorHandler(org.apache.xerces.xni.parser.XMLErrorHandler handler)
Sets the error handler. |
void |
setFeature(java.lang.String featureId,
boolean state)
Sets a feature. |
void |
setInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Sets the input source for the document to parse. |
void |
setLocale(java.util.Locale locale)
Sets the locale. |
void |
setProperty(java.lang.String propertyId,
java.lang.Object value)
Sets a property. |
Methods inherited from class org.apache.xerces.util.ParserConfigurationSettings |
---|
addRecognizedFeatures, addRecognizedProperties, checkFeature, checkProperty, getFeature, getProperty |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface org.apache.xerces.xni.parser.XMLParserConfiguration |
---|
addRecognizedFeatures, addRecognizedProperties, getFeature, getProperty |
Field Detail |
---|
protected static final java.lang.String NAMESPACES
protected static final java.lang.String AUGMENTATIONS
protected static final java.lang.String REPORT_ERRORS
protected static final java.lang.String SIMPLE_ERROR_FORMAT
protected static final java.lang.String BALANCE_TAGS
protected static final java.lang.String NAMES_ELEMS
protected static final java.lang.String NAMES_ATTRS
protected static final java.lang.String FILTERS
protected static final java.lang.String ERROR_REPORTER
protected static final java.lang.String ERROR_DOMAIN
protected org.apache.xerces.xni.XMLDocumentHandler fDocumentHandler
protected org.apache.xerces.xni.XMLDTDHandler fDTDHandler
protected org.apache.xerces.xni.XMLDTDContentModelHandler fDTDContentModelHandler
protected org.apache.xerces.xni.parser.XMLErrorHandler fErrorHandler
protected org.apache.xerces.xni.parser.XMLEntityResolver fEntityResolver
protected java.util.Locale fLocale
protected boolean fCloseStream
protected final java.util.Vector fHTMLComponents
protected final HTMLScanner fDocumentScanner
protected final HTMLTagBalancer fTagBalancer
protected final NamespaceBinder fNamespaceBinder
protected final HTMLErrorReporter fErrorReporter
protected static boolean XERCES_2_0_0
protected static boolean XERCES_2_0_1
protected static boolean XML4J_4_0_x
Constructor Detail |
---|
public HTMLConfiguration()
Method Detail |
---|
public void pushInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource)
Hint: To use this feature to insert the output of <SCRIPT> tags, remember to buffer the entire output of the processed instructions before pushing a new input source. Otherwise, events may appear out of sequence.
inputSource
- The new input source to start scanning.public void setFeature(java.lang.String featureId, boolean state) throws org.apache.xerces.xni.parser.XMLConfigurationException
setFeature
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
setFeature
in class org.apache.xerces.util.ParserConfigurationSettings
org.apache.xerces.xni.parser.XMLConfigurationException
public void setProperty(java.lang.String propertyId, java.lang.Object value) throws org.apache.xerces.xni.parser.XMLConfigurationException
setProperty
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
setProperty
in class org.apache.xerces.util.ParserConfigurationSettings
org.apache.xerces.xni.parser.XMLConfigurationException
public void setDocumentHandler(org.apache.xerces.xni.XMLDocumentHandler handler)
setDocumentHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public org.apache.xerces.xni.XMLDocumentHandler getDocumentHandler()
getDocumentHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void setDTDHandler(org.apache.xerces.xni.XMLDTDHandler handler)
setDTDHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public org.apache.xerces.xni.XMLDTDHandler getDTDHandler()
getDTDHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void setDTDContentModelHandler(org.apache.xerces.xni.XMLDTDContentModelHandler handler)
setDTDContentModelHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public org.apache.xerces.xni.XMLDTDContentModelHandler getDTDContentModelHandler()
getDTDContentModelHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void setErrorHandler(org.apache.xerces.xni.parser.XMLErrorHandler handler)
setErrorHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public org.apache.xerces.xni.parser.XMLErrorHandler getErrorHandler()
getErrorHandler
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void setEntityResolver(org.apache.xerces.xni.parser.XMLEntityResolver resolver)
setEntityResolver
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public org.apache.xerces.xni.parser.XMLEntityResolver getEntityResolver()
getEntityResolver
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void setLocale(java.util.Locale locale)
setLocale
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public java.util.Locale getLocale()
getLocale
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
public void parse(org.apache.xerces.xni.parser.XMLInputSource source) throws org.apache.xerces.xni.XNIException, java.io.IOException
parse
in interface org.apache.xerces.xni.parser.XMLParserConfiguration
org.apache.xerces.xni.XNIException
java.io.IOException
public void setInputSource(org.apache.xerces.xni.parser.XMLInputSource inputSource) throws org.apache.xerces.xni.parser.XMLConfigurationException, java.io.IOException
setInputSource
in interface org.apache.xerces.xni.parser.XMLPullParserConfiguration
inputSource
- The document's input source.
org.apache.xerces.xni.parser.XMLConfigurationException
- Thrown if there is a
configuration error when initializing the
parser.
java.io.IOException
- Thrown on I/O error.parse(boolean)
public boolean parse(boolean complete) throws org.apache.xerces.xni.XNIException, java.io.IOException
parse
in interface org.apache.xerces.xni.parser.XMLPullParserConfiguration
complete
- True if the pull parser should parse the
remaining document completely.
org.apache.xerces.xni.XNIException
- Any XNI exception, possibly wrapping
another exception.
java.io.IOException
- An IO exception from the parser, possibly
from a byte stream or character stream
supplied by the parser.setInputSource(org.apache.xerces.xni.parser.XMLInputSource)
public void cleanup()
cleanup
in interface org.apache.xerces.xni.parser.XMLPullParserConfiguration
protected void addComponent(HTMLComponent component)
protected void reset() throws org.apache.xerces.xni.parser.XMLConfigurationException
org.apache.xerces.xni.parser.XMLConfigurationException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |