You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by Clint Phillips <cp...@apriva.com> on 2001/02/26 19:36:03 UTC
XML4C vs Xeries-C
Sorry for the newbie question, but I wanting to know what the differences
between XML4C and Xercies are. I see that we have IBM engineers on the list,
so maybe they are the same thing. Also, I'm wanting to contribute any
significant additions that I may create in the future(like DOM serialization
as I need to have it for a project), but I currently develop only on the
windows platfrom. As Xerices is cross platform, how would I test on other
platforms? Or is there a core team that writes code and is just looking for
feedback?
Thanks, Clint
Re: XML4C vs Xeries-C
Posted by Bob Kline <bk...@rksystems.com>.
On Mon, 26 Feb 2001, Clint Phillips wrote:
> Sorry for the newbie question, but I wanting to know what the
> differences between XML4C and Xercies are.
The online documentation [1] says:
IBM's XML for C++ parser (XML4C) is based on Apache's Xerces-C XML
parser, which is a validating XML parser written in a portable subset
of C++. XML4C integrates the Xerces-C parser with IBM's International
Components for Unicode (ICU) and extends the number of encodings
supported to over 150.
Also [2]:
On November 9, 1999, the Apache Software Foundation announced the
creation of the xml.apache.org project for Open Source XML solutions.
As part of that announcment, IBM donated its XML4C parser to the
xml.apache.org project along with XML4J and Lotus XSL. The parsing
technologies have been renamed Xerces, and the LotusXSL technology has
been renamed Xalan. IBM is shifting its XML parsing development
resources to work on the Xerces parsers, with the intent of using the
Xerces code base as the foundation for XML4C.
XML4C Version 3.1 is based on Xerces-C Version 1.1. XML4C is Xerces-C
augmented with IBM's International Components for Unicode (ICU).
Xerces-C has intrinsic support for ASCII, UTF-8, UTF-16 (Big/Small
Endian), UCS4 (Big/Small Endian), EBCDIC-US and ISO-8859-1 (aka
Latin1) encodings. This means that it can parse input XML files in
these above mentioned encodings.
However, if you wish to parse XML files in any other encodings, say in
Shift-JIS, Big5 etc., then you cannot use Xerces-C. XML4C addresses
this need. It combines Xerces-C and International Components for
Unicode (ICU) and provides support for over 100 different encodings.
[1] http://www.alphaworks.ibm.com/tech/xml4c
[2] file:///usr/share/xml4c/doc/html/index.html
--
Bob Kline
mailto:bkline@rksystems.com
http://www.rksystems.com