You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@commons.apache.org by bu...@apache.org on 2006/02/08 15:03:40 UTC

DO NOT REPLY [Bug 38569] New: - [lang] StringEscapeUtils.unescapeHtml skips first entity after standalone ampersand

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=38569>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=38569

           Summary: [lang] StringEscapeUtils.unescapeHtml skips first entity
                    after standalone ampersand
           Product: Commons
           Version: 2.1 Final
          Platform: PC
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Lang
        AssignedTo: commons-dev@jakarta.apache.org
        ReportedBy: duy.dao@azonline.ch


StringEscapeUtils.unescapeHtml skips the first entity after a standalone ampersand.

Code:
System.out.println(
StringEscapeUtils.unescapeHtml("test & &auml; &ouml;")
);

Output:
test & &auml; �

Expected output:
test & � �

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


DO NOT REPLY [Bug 38569] - [lang] StringEscapeUtils.unescapeHtml skips first entity after standalone ampersand

Posted by bu...@apache.org.
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=38569>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=38569





------- Additional Comments From trejkaz@trypticon.org  2006-03-06 01:14 -------
Created an attachment (id=17838)
 --> (http://issues.apache.org/bugzilla/attachment.cgi?id=17838&action=view)
Patch to work around this issue

This patch works around spaces being between the '&' and the ';'.  To be more
correct, it should probably ensure that all characters between the '&' and ';'
are valid entity characters, but in most cases the offending character is a
space.

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


DO NOT REPLY [Bug 38569] - [lang] StringEscapeUtils.unescapeHtml skips first entity after standalone ampersand

Posted by bu...@apache.org.
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=38569>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=38569


trejkaz@trypticon.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |trejkaz@trypticon.org




-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org


DO NOT REPLY [Bug 38569] - [lang] StringEscapeUtils.unescapeHtml skips first entity after standalone ampersand

Posted by bu...@apache.org.
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=38569>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=38569


bayard@apache.org changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
         AssignedTo|commons-                    |bayard@apache.org
                   |dev@jakarta.apache.org      |




------- Additional Comments From bayard@apache.org  2006-04-18 06:41 -------
Recreated issue with unit test. The proposed solution is too simple - we need to
be able to handle all situations, not just times when there is a space after the &. 

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org