XML/HTML parsers
 
Description

The HTML parser converts HTML to a format we call lhtml which is a natural encoding of HTML in Lisp lists. lhtml can be converted back into HTML using the htmlgen package that's part of AllegroServe.

The XML parser is a non-validating parser that converts XML into lxml, a format similar to lhtml.

Author
Steve Jacobson and John Foderaro, with additional work by Steve Haflich, and others at Franz Inc.
Author's Comments
The XML parser is evolving and in the future may differ substantially from the current version.
Release History
Documentation
References
Platform
The HTML parser works on Allegro Common Lisp versions 5.0.1 and later although if your HTML code includes 16-bit character entities the parser will only be able to represent these characters in the International version of Allegro Common Lisp version 6.0 or later.

The XML parser works on Allegro Common Lisp versions 6.0 or later as it requires Unicode support.

Building
The HTML parser need only be compiled and loaded into Lisp.

To build the XML parser, load build.cl.

Files

Download the source code:

Download the binaries for Allegro CL 6.0:

Use sys:update-allegro to download the latest version for your platform.

License

The xmlutils source code is licensed under the terms of the Lisp Lesser GNU Public License, known as the LLGPL. The LLGPL consists of a preamble and the LGPL. Where these conflict, the preamble takes precedence. xmlutils is referenced in the preamble as the LIBRARY.

Open Source
This project is hosted on the http://opensource.franz.com site. There is an informal community support and development mailing list (opensource@franz.com) for these open source projects. We encourage you to take advantage by subscribing to the list. Click here to subscribe or unsubscribe. Once you're subscribed, send email to opensource@franz.com with your questions, comments, suggestions, and patches.