You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Richard Kelly (JIRA)" <xe...@xml.apache.org> on 2009/07/12 10:49:14 UTC

[jira] Updated: (XERCESJ-1383) Adding Unicode Normalization support to Xerces2-J

     [ https://issues.apache.org/jira/browse/XERCESJ-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Richard Kelly updated XERCESJ-1383:
-----------------------------------

    Attachment: CharacterNormalizer.java
                CharacterNormalizer.patch

This is an initial set of patches to implement character normalization in Xerces2-J.

CharacterNormalizer.patch represents the changes that need to be made to existing code.
CharacterNormalizer.java is the XNI component that actually implements character normalization.

These patches are currently still under development, but I welcome any feedback or suggestions on ways to improve them.

> Adding Unicode Normalization support to Xerces2-J 
> --------------------------------------------------
>
>                 Key: XERCESJ-1383
>                 URL: https://issues.apache.org/jira/browse/XERCESJ-1383
>             Project: Xerces2-J
>          Issue Type: New Feature
>          Components: DOM (Level 3 Core), SAX
>    Affects Versions: 2.9.1
>         Environment: All
>            Reporter: Richard Kelly
>         Attachments: CharacterNormalizer.java, CharacterNormalizer.patch
>
>
> This feature will add support for Unicode character normalization and normalization checking to Xerces.  Applications that use Xerces will be able to produce fully normalized XML documents and verify that any XML documents they process are fully normalised. 
> Adding this functionality will allow Xerces to meet the XML 1.1 W3C Recommendation regarding character normalization and allow it to implement the optional character normalization and normalization checking features specified in the DOM Level 3 Core and SAX2.
> More specifically, the features to be implemented are:
> DOM Level 3 Core: "normalize-characters" [1]
> DOM Level 3 Core: "check-character-normalization" [2]
> SAX2: "unicode-normalization-checking" [3]
> [1] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-normalize-characters
> [2] http://www.w3.org/TR/DOM-Level-3-Core/core.html#parameter-check-character-normalization
> [3] http://www.saxproject.org/apidoc/org/xml/sax/package-summary.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org