You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sling.apache.org by "Jason E Bailey (JIRA)" <ji...@apache.org> on 2017/04/20 12:45:04 UTC

[jira] [Created] (SLING-6783) update for org.apache.commons.html

Jason E Bailey created SLING-6783:
-------------------------------------

             Summary: update for org.apache.commons.html
                 Key: SLING-6783
                 URL: https://issues.apache.org/jira/browse/SLING-6783
             Project: Sling
          Issue Type: Improvement
          Components: Commons
            Reporter: Jason E Bailey
            Priority: Minor
         Attachments: sling.patch

Following updates:

Updated tagsoup lib to 1.2.1 which has the following modifications

* DOCTYPE is now recognized even in lower case.
* We make sure to buffer the reader, eliminating a long-standing bug that would crash on certain inputs, such as & followed by CR+LF.
* The HTML scanner's table is precompiled at run time for efficiency, causing a 4x speedup on large input documents.
* ]] within a CDATA section no longer causes input to be discarded.
* Remove bogus newline after printing children of the root element.
* Allow the noscript element anywhere, the same as the script element.
* Updated to the 2011 edition of the W3C character entity list.

Additionally:
Updated license with new home page for tagsoup
Updated annotations to OSGi annotations
Added the ability to specify additional features/properties for the parser
Documented available settings




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)