You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@maven.apache.org by "Eirik Maus (JIRA)" <ji...@codehaus.org> on 2005/08/31 10:20:02 UTC

[jira] Created: (MAVEN-1679) maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml

maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml
----------------------------------------------------------------------------

         Key: MAVEN-1679
         URL: http://jira.codehaus.org/browse/MAVEN-1679
     Project: Maven
        Type: Bug
  Components: model  
    Versions: 1.1-beta-1    
 Reporter: Eirik Maus



To make project.xml readable across operating systems and parsers (even when turned into html by the site plugin), we have used entities for non-US characters in project xml.   The XML parser used in maven 1.1 chokes on the use of these entities (but not on the entity definition). This is very unfortunate, as using entities for abbreviations and symbols is perfectly legal Xml. 

Example: won't work with 1.1:

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE project [
    <!ENTITY OSlash "&#248;">
    <!ENTITY CapitalOSlash "&#216;">
]>
<project>
    <pomVersion>3</pomVersion>
 ...
    <developers>
        <developer>
            <name>Marit Finne J&OSlash;rgensen</name>
            <id>mfj</id>
        </developer>
    </developers>
....
</project>


Example: fix for 1.1, with cross-system compatibility issues. 

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE project [
    <!ENTITY OSlash "&#248;">
    <!ENTITY CapitalOSlash "&#216;">
]>
<project>
    <pomVersion>3</pomVersion>
 ...
    <developers>
        <developer>
            <name>Marit Finne Jørgensen</name>
            <id>mfj</id>
        </developer>
    </developers>
....
</project>


The XML parser chokes on the Usage of the XML Entity, inside 'Jørgensen', not on the definition.  



-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org


[jira] Commented: (MAVEN-1679) maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml

Posted by "Trygve Laugstol (JIRA)" <ji...@codehaus.org>.
    [ http://jira.codehaus.org/browse/MAVEN-1679?page=comments#action_45546 ] 

Trygve Laugstol commented on MAVEN-1679:
----------------------------------------

All the standard HTML entities should be supported, so use &oslash; for 'ø'.

> maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml
> ----------------------------------------------------------------------------
>
>          Key: MAVEN-1679
>          URL: http://jira.codehaus.org/browse/MAVEN-1679
>      Project: Maven
>         Type: Bug
>   Components: model
>     Versions: 1.1-beta-1
>     Reporter: Eirik Maus

>
>
> To make project.xml readable across operating systems and parsers (even when turned into html by the site plugin), we have used entities for non-US characters in project xml.   The XML parser used in maven 1.1 chokes on the use of these entities (but not on the entity definition). This is very unfortunate, as using entities for abbreviations and symbols is perfectly legal Xml. 
> Example: won't work with 1.1:
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <!DOCTYPE project [
>     <!ENTITY OSlash "&#248;">
>     <!ENTITY CapitalOSlash "&#216;">
> ]>
> <project>
>     <pomVersion>3</pomVersion>
>  ...
>     <developers>
>         <developer>
>             <name>Marit Finne J&OSlash;rgensen</name>
>             <id>mfj</id>
>         </developer>
>     </developers>
> ....
> </project>
> Example: fix for 1.1, with cross-system compatibility issues. 
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <!DOCTYPE project [
>     <!ENTITY OSlash "&#248;">
>     <!ENTITY CapitalOSlash "&#216;">
> ]>
> <project>
>     <pomVersion>3</pomVersion>
>  ...
>     <developers>
>         <developer>
>             <name>Marit Finne Jørgensen</name>
>             <id>mfj</id>
>         </developer>
>     </developers>
> ....
> </project>
> The XML parser chokes on the Usage of the XML Entity, inside 'Jørgensen', not on the definition.  

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org


[jira] Commented: (MAVEN-1679) maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml

Posted by "Brett Porter (JIRA)" <ji...@codehaus.org>.
    [ http://jira.codehaus.org/browse/MAVEN-1679?page=comments#action_48105 ] 

Brett Porter commented on MAVEN-1679:
-------------------------------------

we do need to support a xerces based parser though for backwards compat...

> maven 1.1-beta-1 chokes on XML Entities for non-US characters in project.xml
> ----------------------------------------------------------------------------
>
>          Key: MAVEN-1679
>          URL: http://jira.codehaus.org/browse/MAVEN-1679
>      Project: Maven
>         Type: Bug
>   Components: model
>     Versions: 1.1-beta-1
>     Reporter: Eirik Maus

>
>
> To make project.xml readable across operating systems and parsers (even when turned into html by the site plugin), we have used entities for non-US characters in project xml.   The XML parser used in maven 1.1 chokes on the use of these entities (but not on the entity definition). This is very unfortunate, as using entities for abbreviations and symbols is perfectly legal Xml. 
> Example: won't work with 1.1:
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <!DOCTYPE project [
>     <!ENTITY OSlash "&#248;">
>     <!ENTITY CapitalOSlash "&#216;">
> ]>
> <project>
>     <pomVersion>3</pomVersion>
>  ...
>     <developers>
>         <developer>
>             <name>Marit Finne J&OSlash;rgensen</name>
>             <id>mfj</id>
>         </developer>
>     </developers>
> ....
> </project>
> Example: fix for 1.1, with cross-system compatibility issues. 
> <?xml version="1.0" encoding="ISO-8859-1"?>
> <!DOCTYPE project [
>     <!ENTITY OSlash "&#248;">
>     <!ENTITY CapitalOSlash "&#216;">
> ]>
> <project>
>     <pomVersion>3</pomVersion>
>  ...
>     <developers>
>         <developer>
>             <name>Marit Finne Jørgensen</name>
>             <id>mfj</id>
>         </developer>
>     </developers>
> ....
> </project>
> The XML parser chokes on the Usage of the XML Entity, inside 'Jørgensen', not on the definition.  

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org