You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by Clint Phillips <cp...@apriva.com> on 2001/02/26 19:36:03 UTC

XML4C vs Xeries-C

Sorry for the newbie question, but I wanting to know what the differences
between XML4C and Xercies are. I see that we have IBM engineers on the list,
so maybe they are the same thing. Also, I'm wanting to contribute any
significant additions that I may create in the future(like DOM serialization
as I need to have it for a project), but I currently develop only on the
windows platfrom. As Xerices is cross platform, how would I test on other
platforms? Or is there a core team that writes code and is just looking for
feedback?

Thanks, Clint


Re: XML4C vs Xeries-C

Posted by Bob Kline <bk...@rksystems.com>.
On Mon, 26 Feb 2001, Clint Phillips wrote:

> Sorry for the newbie question, but I wanting to know what the
> differences between XML4C and Xercies are.

The online documentation [1] says:

  IBM's XML for C++ parser (XML4C) is based on Apache's Xerces-C XML
  parser, which is a validating XML parser written in a portable subset
  of C++. XML4C integrates the Xerces-C parser with IBM's International
  Components for Unicode (ICU) and extends the number of encodings
  supported to over 150. 

Also [2]:

  On November 9, 1999, the Apache Software Foundation announced the
  creation of the xml.apache.org project for Open Source XML solutions.
  As part of that announcment, IBM donated its XML4C parser to the
  xml.apache.org project along with XML4J and Lotus XSL. The parsing
  technologies have been renamed Xerces, and the LotusXSL technology has
  been renamed Xalan. IBM is shifting its XML parsing development
  resources to work on the Xerces parsers, with the intent of using the
  Xerces code base as the foundation for XML4C.

  XML4C Version 3.1 is based on Xerces-C Version 1.1. XML4C is Xerces-C
  augmented with IBM's International Components for Unicode (ICU).

  Xerces-C has intrinsic support for ASCII, UTF-8, UTF-16 (Big/Small
  Endian), UCS4 (Big/Small Endian), EBCDIC-US and ISO-8859-1 (aka
  Latin1) encodings. This means that it can parse input XML files in
  these above mentioned encodings. 

  However, if you wish to parse XML files in any other encodings, say in
  Shift-JIS, Big5 etc., then you cannot use Xerces-C. XML4C addresses
  this need. It combines Xerces-C and International Components for
  Unicode (ICU) and provides support for over 100 different encodings.

[1] http://www.alphaworks.ibm.com/tech/xml4c
[2] file:///usr/share/xml4c/doc/html/index.html

-- 
Bob Kline 
mailto:bkline@rksystems.com
http://www.rksystems.com