You are viewing a plain text version of this content. The canonical link for it is here.
Posted to xindice-users@xml.apache.org by Martin Holz <ho...@fiz-chemie.de> on 2002/02/06 22:48:27 UTC

Bug in XMLResource.getContentAsSAX ?

Hi,

XMLResource.getContentAsSAX seems to handle namespace uris for attributes 
wrong. I have added a XHTML document with a default namesspace declaration 
using a DOM. If a retrieve the document with the command line tool, 
everything looks fine:
+------------------
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta content="HTML Tidy, see www.w3.org" name="generator" />
    <title>
   ......
+-------------------

However retrieving  the resource with  XMLResource.getContentAsSAX ,
I get this kind of SAX events:
+------------------------
......
[startElement] uri=http://www.w3.org/1999/xhtml,local=meta,raw=meta
[            ] 1. 
uri=http://www.w3.org/1999/xhtml,local=content,qname=content,type=,value=HTML 
Tidy, see www.w3.org
[            ] 2. 
uri=http://www.w3.org/1999/xhtml,local=name,qname=name,type=,value=generator
[endElement] uri=http://www.w3.org/1999/xhtml,local=meta,qname=meta
.......
+-------------------------
The namespace uri for the "meta" element is okay. However its attributes 
should have no namespace uri.  (See
http://www.w3.org/TR/1999/REC-xml-names-19990114/#defaulting)

My current workaraound is to retrieve the document as string
and parse it on the client side.

Regards
	Martin 

--
Martin Holz  <ho...@fiz-chemie.de>  phone: 0049-30-39977 218 
FIZ CHEMIE BERLIN