You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/10/24 18:15:33 UTC

[jira] Closed: (NUTCH-110) OpenSearchServlet outputs illegal xml characters

     [ http://issues.apache.org/jira/browse/NUTCH-110?page=all ]

Sami Siren closed NUTCH-110.
----------------------------


> OpenSearchServlet outputs illegal xml characters
> ------------------------------------------------
>
>                 Key: NUTCH-110
>                 URL: http://issues.apache.org/jira/browse/NUTCH-110
>             Project: Nutch
>          Issue Type: Bug
>          Components: searcher
>    Affects Versions: 0.8
>         Environment: linux, jdk 1.5
>            Reporter: stack@archive.org
>         Assigned To: Sami Siren
>             Fix For: 0.8
>
>         Attachments: fixIllegalXmlChars.patch, fixIllegalXmlChars08-v2.patch, fixIllegalXmlChars08-v3.patch, fixIllegalXmlChars08-v4.patch, fixIllegalXmlChars08-v5.patch, fixIllegalXmlChars08.patch, NUTCH-110-version2.patch
>
>
> OpenSearchServlet does not check text-to-output for illegal xml characters; dependent on  search result, its possible for OSS to output xml that is not well-formed.  For example, if text has the character FF character in it -- -- i.e. the ascii character at position (decimal) 12 --  the produced XML will show the FF character as '&#12;' The character/entity '&#12;' is not legal in XML according to http://www.w3.org/TR/2000/REC-xml-20001006#NT-Char.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira