SunQuest
 
       Python
  Home arrow Python arrow Page 2 - Parsing XML with SAX and Python
Dev Shed Forums 
Administration  
AJAX  
Apache  
BrainDump  
DHTML  
Flash  
Java  
JavaScript  
Multimedia  
MySQL  
Oracle  
Perl  
PHP  
Practices  
Python  
Reviews  
Security  
Style-Sheets  
Web Services  
XML  
Zend  
Zope  
Forums Sitemap 
IBM® developerWorks 
Sun Developer Network 
Dedicated Servers 
E-Commerce Hosting 
Linux Web Hosting 
Managed Hosting 
Small Business Hosting 
Actuate Whitepapers 
VeriSign Whitepapers 
VPS Hosting 
Weekly Newsletter

 
Developer Updates  
Free Website Content 
SunQuest
 RSS  Articles
 RSS  Forums
 RSS  All Feeds
Write For Us Get Paid 
Request Media Kit
Contact Us 
Site Map 
Privacy Policy 
Support 
 USERNAME
 
 PASSWORD
 
 
  >>> SIGN UP!  
  Lost Password? 
PYTHON

Parsing XML with SAX and Python
By: Nadia Poulou
  • Search For More Articles!
  • Disclaimer
  • Author Terms
  • Rating: 4 stars4 stars4 stars4 stars4 stars / 12
    2003-01-21

    Table of Contents:
  • Parsing XML with SAX and Python
  • The xml.sax Package
  • Our SAX Parser
  • Homework
  • Conclusion

  • Rate this Article: Poor Best 
      ADD THIS ARTICLE TO:
      Del.ici.ous Digg
      Blink Simpy
      Google Spurl
      Y! MyWeb Furl
    Email Me Similar Content When Posted
    Add Developer Shed Article Feed To Your Site
    Email Article To Friend
    Print Version Of Article
    PDF Version Of Article
     
     
    ADVERTISEMENT

    AT&T devCentral & BlackBerry(r) Webcast Series: BlackBerry and GPS -Build Location Awareness into your BlackBerry Applications, July 10th -1:00PM EST. Register Today!

    Parsing XML with SAX and Python - The xml.sax Package


    (Page 2 of 5 )

    SAX is a simple API for XML. The package xml.sax and its sub packages provide a Python implementation of the SAX interface.

    The structure of a SAX application should include one or more input sources, parser and handler objects. The idea is as follows: a parser reads the bytes or characters from the input source and fires a sequence of events on the handler. In this document and in the Python documentation the term ‘reader’ is preferred over ‘parser’.

    The SAX API defines four basic interfaces. Since Python does not support interfaces, these SAX interfaces are implemented in the xml.sax.handler module as the following Python classes:
    1. ContentHandler: this implements the main SAX interface for handling document events. It is also the interface which we will use in the example of the next section
    2. DTDHandler: class for handling DTD events
    3. EntityResolver: class for resolving external entities
    4. ErrorHandler: as the name suggests, this class is used for reporting all errors and warnings.
    I would like to mention here the presence of the DefaultHandler class from the xml.sax.saxutils package that inherits from all four interfaces above. An application needs to implement only the interfaces it needs, as will be shown by the following example.

    Now we have checked out the interfaces, it’s time to see the basic methods of the xml.sax package. These are:

    make_parser() - This will create and return an SAX XMLReader object. Notice that the xml.sax readers are non-validating.

    parse(filename, handler) - This will create a parser and parse the given document (which can be passed either as a file object or as a stream). The handler is one of the SAX interfaces we mentioned above.

    A reader and a handler can be connected with the appropriate method (for example setContentHandler() for a ContentHandler object). Once this happens, the reader will notify of parsing events through the methods of the handler. In the following example, the methods startElement(), endElement() and characters() of the ContentHandler illustrate this procedure.

    We will not go into error handling details in this document, but xml.sax provides enough exception classes for your programming needs. In the Python reference documentation you may find more details.

    Enough with the theory, let’s move on to a hands-on example.

    More Python Articles
    More By Nadia Poulou


     

       

    PYTHON ARTICLES

    - SSH with Twisted
    - Mobile Programming in Python using PyS60: UI...
    - Python: Count on It
    - Python Strings: Spinning Yarns
    - Python: More Fun with Strings
    - Python: Stringing You Along
    - Python Operators
    - Bluetooth Programming in Python: Network Pro...
    - Python Sets
    - Python Conditionals, Lists, Dictionaries, an...
    - Python: Input and Variables
    - Introduction to Python Programming
    - Mobile Programming in Python using PyS60: Ge...
    - Bluetooth Programming using Python
    - Finishing the PyMailGUI Client: User Help To...

    IBM developerWorks




    © 2003-2008 by Developer Shed. All rights reserved. DS Cluster 2 hosted by Hostway