You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2004/08/12 16:41:20 UTC
DO NOT REPLY [Bug 30617] New: -
HTMLParser doesn't parse hexadecimal character references
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=30617>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND
INSERTED IN THE BUG DATABASE.
http://issues.apache.org/bugzilla/show_bug.cgi?id=30617
HTMLParser doesn't parse hexadecimal character references
Summary: HTMLParser doesn't parse hexadecimal character
references
Product: Lucene
Version: 1.0.2
Platform: All
OS/Version: All
Status: NEW
Severity: Normal
Priority: Other
Component: Examples
AssignedTo: lucene-dev@jakarta.apache.org
ReportedBy: Dave.Sparks@teamware.co.uk
I recently inherited a project from an ex-colleague; it uses Lucene and in
particular the HTML Parser. I've found that she had made an amendment to the
parser to allow it to parse and decode hexadecimal character references, which
we depend on, but had not reported a bug. If she had, someone might have
pointed out that her correction was wrong ...
I don't seem to be able to attach the (fairly trivial) patch to an initial bug
report (and in any case I've failed to find the instructions for generating a
diff file in the right format, even though I'm sure I've seen it somewhere).
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org