JAXP SAX: Difference between revisions

From NovaOrdis Knowledge Base
Jump to navigation Jump to search
Line 19: Line 19:
When an XML tag is recognized, the parser invokes the corresponding methods (<tt>startDocument()</tt>, <tt>startElement()</tt>), ...) on the <tt>ContentHandler</tt> implementation.  
When an XML tag is recognized, the parser invokes the corresponding methods (<tt>startDocument()</tt>, <tt>startElement()</tt>), ...) on the <tt>ContentHandler</tt> implementation.  


The <tt>ErrorHandler</tt> implementation is messaged on various parsing errors. The default implementation is rudimentary, if a more nuanced behavior is necessary, the handler must be implemented. Note that [[JAXP DOM]] and SAX parsers handle errors in a similar manner, the same exceptions are generated so the error handling code is virtually identical.
<span in="error_handling">The <tt>ErrorHandler</tt> implementation</span> is messaged on various parsing errors. The default implementation is rudimentary, if a more nuanced behavior is necessary, the handler must be implemented. Note that [[JAXP DOM]] and SAX parsers handle errors in a similar manner, the same exceptions are generated so the error handling code is virtually identical.


SAX parsers have low memory requirements, as they don't construct an internal representation of the XML document.
SAX parsers have low memory requirements, as they don't construct an internal representation of the XML document.

Revision as of 02:25, 11 November 2016

External

Internal

Overview

A SAX parser implements event-driven, serial-access push parsing. It uses a streaming model.

SAX parsers can only be used for state-independent processing, where the handling of an element does not depend on elements that came before, unlike StAX, which can be used for state-dependent processing.

SAX is a read-only API, XML documents can only be read with SAX, not written.

Start by generating a parser instance with SAXParserFactory. The actual implementation of the parser implements the SAXParser interface and is determined by the value of javax.xml.parsers.SAXParserFactory system property. Then call parsers's parse() method. The parser contains a SAXReader instance, which invokes callback methods the application must implement. The methods are defined by the ContentHandler, ErrorHandler, DTDHandler and EntityResolver interfaces.

When an XML tag is recognized, the parser invokes the corresponding methods (startDocument(), startElement()), ...) on the ContentHandler implementation.

The ErrorHandler implementation is messaged on various parsing errors. The default implementation is rudimentary, if a more nuanced behavior is necessary, the handler must be implemented. Note that JAXP DOM and SAX parsers handle errors in a similar manner, the same exceptions are generated so the error handling code is virtually identical.

SAX parsers have low memory requirements, as they don't construct an internal representation of the XML document.

The SAX parser provide access to the original document location information (line and column), via the Locator injected into the ContentHandler. For an example, see Location in an XML document.

For a working example of SAX parsing, see SAX Examples below.

Difference between Pull Parsing and Push Parsing

Difference between Pull Parsing and Push Parsing

SAX Examples

SAX Examples

Component Packages