You are viewing a plain text version of this content. The canonical link for it is here.
Posted to xindice-users@xml.apache.org by Martin Holz <ho...@fiz-chemie.de> on 2002/02/06 22:48:27 UTC
Bug in XMLResource.getContentAsSAX ?
Hi,
XMLResource.getContentAsSAX seems to handle namespace uris for attributes
wrong. I have added a XHTML document with a default namesspace declaration
using a DOM. If a retrieve the document with the command line tool,
everything looks fine:
+------------------
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta content="HTML Tidy, see www.w3.org" name="generator" />
<title>
......
+-------------------
However retrieving the resource with XMLResource.getContentAsSAX ,
I get this kind of SAX events:
+------------------------
......
[startElement] uri=http://www.w3.org/1999/xhtml,local=meta,raw=meta
[ ] 1.
uri=http://www.w3.org/1999/xhtml,local=content,qname=content,type=,value=HTML
Tidy, see www.w3.org
[ ] 2.
uri=http://www.w3.org/1999/xhtml,local=name,qname=name,type=,value=generator
[endElement] uri=http://www.w3.org/1999/xhtml,local=meta,qname=meta
.......
+-------------------------
The namespace uri for the "meta" element is okay. However its attributes
should have no namespace uri. (See
http://www.w3.org/TR/1999/REC-xml-names-19990114/#defaulting)
My current workaraound is to retrieve the document as string
and parse it on the client side.
Regards
Martin
--
Martin Holz <ho...@fiz-chemie.de> phone: 0049-30-39977 218
FIZ CHEMIE BERLIN