You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-users@xerces.apache.org by "WATKIN-JONES,ADAM (HP-UnitedKingdom,ex1)" <ad...@hp.com> on 2001/11/14 17:35:29 UTC

Xerces 1.4.3 - possible bug

Hi!

I've noticed some funny behaviour with Xerces 1.4.3.  XML file saved as
ASCII:

<?xml version="1.0" encoding="utf-8"?>
<root>some nice  content</root>

ie, I've put a bad character in for the chosen encoding.

Xerces 1.4.3 loads this into a JAXP Document without reporting an error.
Xerces 2.0.0 beta 3 throws the error 'Exception in thread "main"
java.io.UTFDataFormatException: invalid byte 2 of 2-byte UTF-8 sequence
(0x20)'.

Looking in the bug database, I think that 1793 describes the same problem.
However, the date opened is 2001-05-17 00:04 and I cannot see any activity
since although it has a 'High' priority.  The Xerces version is 1.3.1.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=1793

Unfortunately, I can't get to the list archives right now
(http://archive.covalent.net/ doesn't respond for me).

Have I spotted a bug? In which case, is it the same bug as 1793?  Why (apart
from the Xerces developers being busy enough as it is!) has 1793 languished
unattended?

Finally, are there plans for any further bug-fix releases of the 1.* series?

Many thanks,
Adam



---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-user-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-user-help@xml.apache.org