You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@archiva.apache.org by "Wendy Smoak (JIRA)" <ji...@codehaus.org> on 2009/01/20 20:43:19 UTC

[jira] Created: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Archiva should clean bad UTF-8 data in poms
-------------------------------------------

                 Key: MRM-1069
                 URL: http://jira.codehaus.org/browse/MRM-1069
             Project: Archiva
          Issue Type: Improvement
    Affects Versions: 1.1.2
            Reporter: Wendy Smoak


Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.

Possible places for this to occur:

1) During the repository scan for content.
   Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")

2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Updated: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Posted by "Brett Porter (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MRM-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brett Porter updated MRM-1069:
------------------------------

    Fix Version/s:     (was: 1.4)

> Archiva should clean bad UTF-8 data in poms
> -------------------------------------------
>
>                 Key: MRM-1069
>                 URL: http://jira.codehaus.org/browse/MRM-1069
>             Project: Archiva
>          Issue Type: Improvement
>    Affects Versions: 1.1.2
>            Reporter: Wendy Smoak
>            Assignee: Brett Porter
>
> Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.
> Possible places for this to occur:
> 1) During the repository scan for content.
>    Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")
> 2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Closed: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Posted by "Brett Porter (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MRM-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brett Porter closed MRM-1069.
-----------------------------

    Resolution: Won't Fix
      Assignee: Brett Porter

AFAICT, Maven works with these UTF characters, and it is really not valid XML to use &oslash; without defining it (Maven cheats by adding these HTML characters into the parser itself). There are some characters that trouble certain IBM JDKs, but if that's a scenario someone is faced with I think it'd warrant a separate plugin now.

> Archiva should clean bad UTF-8 data in poms
> -------------------------------------------
>
>                 Key: MRM-1069
>                 URL: http://jira.codehaus.org/browse/MRM-1069
>             Project: Archiva
>          Issue Type: Improvement
>    Affects Versions: 1.1.2
>            Reporter: Wendy Smoak
>            Assignee: Brett Porter
>             Fix For: 1.4
>
>
> Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.
> Possible places for this to occur:
> 1) During the repository scan for content.
>    Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")
> 2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Updated: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Posted by "Maria Odea Ching (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MRM-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Maria Odea Ching updated MRM-1069:
----------------------------------

    Fix Version/s:     (was: 1.3)
                   1.3.1

> Archiva should clean bad UTF-8 data in poms
> -------------------------------------------
>
>                 Key: MRM-1069
>                 URL: http://jira.codehaus.org/browse/MRM-1069
>             Project: Archiva
>          Issue Type: Improvement
>    Affects Versions: 1.1.2
>            Reporter: Wendy Smoak
>             Fix For: 1.3.1
>
>
> Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.
> Possible places for this to occur:
> 1) During the repository scan for content.
>    Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")
> 2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Updated: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Posted by "Brett Porter (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MRM-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brett Porter updated MRM-1069:
------------------------------

    Fix Version/s: 1.2

> Archiva should clean bad UTF-8 data in poms
> -------------------------------------------
>
>                 Key: MRM-1069
>                 URL: http://jira.codehaus.org/browse/MRM-1069
>             Project: Archiva
>          Issue Type: Improvement
>    Affects Versions: 1.1.2
>            Reporter: Wendy Smoak
>             Fix For: 1.2
>
>
> Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.
> Possible places for this to occur:
> 1) During the repository scan for content.
>    Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")
> 2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Updated: (MRM-1069) Archiva should clean bad UTF-8 data in poms

Posted by "Brett Porter (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MRM-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brett Porter updated MRM-1069:
------------------------------

    Fix Version/s:     (was: 1.2)
                   1.3

> Archiva should clean bad UTF-8 data in poms
> -------------------------------------------
>
>                 Key: MRM-1069
>                 URL: http://jira.codehaus.org/browse/MRM-1069
>             Project: Archiva
>          Issue Type: Improvement
>    Affects Versions: 1.1.2
>            Reporter: Wendy Smoak
>             Fix For: 1.3
>
>
> Archiva should have a mechanism to clean the repository of bad poms that contain improper UTF8 data.
> Possible places for this to occur:
> 1) During the repository scan for content.
>    Have a process that reads the xml data as a stream (not as an xml document) and process out any improper UTF8 characters with appropriate XML latin entities. (example: changing "ø" to "%oslash;")
> 2) During pom download, check the pom and clean it inline on the way down to the user.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira