You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Hans Aikema (Jira)" <ji...@apache.org> on 2021/09/08 15:00:00 UTC

[jira] [Commented] (TEXT-193) HTML4 unEscape should parse $

    [ https://issues.apache.org/jira/browse/TEXT-193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17411986#comment-17411986 ] 

Hans Aikema commented on TEXT-193:
----------------------------------

&dollar; is a valid HTML5 named entity per [https://dev.w3.org/html5/html-author/charref,] but it's not a valid HTML4 entity [https://www.w3.org/TR/WD-html40-970708/sgml/entities.html]

So in my view unescapeHtml4 rightfully does not unescape &dollar; and you'd like to see an extension of the StringEscapeUtils so that it can also unescape HTML5.

> HTML4 unEscape should parse &dollar;
> ------------------------------------
>
>                 Key: TEXT-193
>                 URL: https://issues.apache.org/jira/browse/TEXT-193
>             Project: Commons Text
>          Issue Type: Wish
>            Reporter: Ajay Kumar Singh
>            Priority: Major
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> I have a string which has &dollar; but I don't see that getting unescaped by commons



--
This message was sent by Atlassian Jira
(v8.3.4#803005)