XML and SGML Resources

General Information about XML and SGML
Including information about computing resources at the University. There is also a PDF version of this document (without anchors) that is suitable for printing.

Reference Card
Tools that are presently locally available on the campus Unix network.

XML site at W3C
The World Wide Web Consortium XML site is the location for the latest and most authoritative information on many XML topics. For a quick read see XML in 10 Points, http://www.w3.org/XML/1999/XML-in-10-points.

Various kinds of information about XML sponsored by the publisher O'Reilly & Associates, Inc.

A compact index of XML resources

XML at 5
An article by Dave Hollander and C. M. Sperberg-McQueen, cochairs of the W3C XML Coordination Group, February 10, 2003.

XML4Lib: A Discussion List about XML for Librarians
Begun February, 2001.

DocBook XML and Debian Stuff
Well done site by Mark Johnson, hosted by the Duke University Linux User Group with a focus on DocBook.

XML and the Second Generation Web
An article in the May 1999 issue of Scientific American by Jon Bosak of Sun Microsystems and Tim Bray of Textuality, both pioneers.

Languages for the World Wide Web
At the Web Developers Virtual Library. Includes a tutorial introduction to XML.

Haskell and XML: Generic Combinators or Type-Based Translation?
Article by Malcolm Wallace and Colin Runciman discussing a general framework for XML processing. Accompanied by software.

XML Processing in Python

Cost: an SGML translator
Quoted summary: Cost is a structure-controlled SGML application programming tool. It is implemented as a Tcl extension, and works in conjunction with the nsgmls, sgmls, and/or expat parsers (all courtesy James Clark). Cost is free software.

Scholarly Technology Group, Brown University
A humanities-slanted support service at Brown University with expertise in the use of SGML and XML. For technical aids see the utilities and tools section.

Re: explain by example
A posting saved from the UseNet newsgroup comp.text.xml. Its author was responding to a request for justification by examples of the value of XML.

Death to Sleepy Stock Data
An article in Wired reports that the large EDGAR data base at the U.S. Securities and Exchange Commission is going to be made more accessible via XML (April 1, 1999).

Using SGML for EDI
A question with two responses in the spring of 1999 from the UseNet newsgroup comp.text.sgml.

XML in Excelsis
An article from the column Internet Basics in Computer Currents Interactive, National Edition, March 9, 1999.

Documentation Project Tutorial at the FreeBSD Organization

This excellent tutorial, under development in mid 1999 and last revised in 2009, contains material on general SGML markup as well as tutorials on two specific languages under SGML: DocBook and HTML.

Both the FreeBSD Documentation Project and the Linux Documentation Project appear to be fully committed to migration from the older, more limited LinuxDoc language, to the more powerful DocBook language.

OASIS : Organization for the Advancement of Structured Information Standards

Document Structure Description (DSD)
A February 2000 proposal from AT&T and BRICS, University of Aarhus, that offers an alternative to XML schemas and XML DTDs.

The SGML Forum of New York
An organization based in New York City that meets monthly.

Frequently Asked Questions About SGML
Very short and basic. From the UseNet newsgroup comp.text.sgml.

Frequently Asked Questions About XML
An extensive FAQ by a team of authors. This web site is hosted by University College, Cork, Ireland. Access from the U.S. is sometimes slow.

Two versions of the same thing: valid and invalid HTML.
Here are sample outputs from running a validator based on James Clark's SP on these files. Note: Although some browsers are very good at compensating for HTML errors, it is possible for HTML errors to cause even the best browsers to fail in an effort to render.

W3C HTML Validation Service
The W3C service will not give a clean report unless the HTML file submitted has an SGML Document Type Declaration. For common HTML pages this may be done by adding as first line the following case-sensitive string:

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">

(The lower case string "html" for the root tag under HTML 4.01 is not actually case sensitive but this and all other tag names under the corresponding XML version of HTML must be lower case.)

If you are logged to the campus Unix network, you should use the program /usr/local/xml/bin/validhtml rather than frequently banging on the W3C validator (which might then decide to boycott you).

UP  |  TOP  |  COMMENT   --   Last change: 28 Jan 2009