You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@empire-db.apache.org by "Sebb (JIRA)" <em...@incubator.apache.org> on 2012/08/09 22:02:19 UTC

[jira] [Created] (EMPIREDB-156) DOAP file has encoding error

Sebb created EMPIREDB-156:
-----------------------------

             Summary: DOAP file has encoding error
                 Key: EMPIREDB-156
                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
             Project: Empire-DB
          Issue Type: Bug
            Reporter: Sebb


The project build is failing with:

/usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
/x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0xF6 0x62 0x65 0x6C
        <foaf:name>Rainer Döbele</foaf:name>
                           ^
unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf

I've disabled the entry:

URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
Log:
empire-db has encoding error

Modified:
    infrastructure/site-tools/trunk/projects/files.xml

Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13432146#comment-13432146 ] 

Francis De Brabandere commented on EMPIREDB-156:
------------------------------------------------

Are you sure the file is not UTF8?

http://empire-db.apache.org/doap_Empire-db.rdf

Just checked with http://www.w3.org/RDF/Validator/ and everything seems to be ok
                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Assigned] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Francis De Brabandere reassigned EMPIREDB-156:
----------------------------------------------

    Assignee: Francis De Brabandere
    
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13434955#comment-13434955 ] 

Francis De Brabandere commented on EMPIREDB-156:
------------------------------------------------

sebb, are you sure this issue is on our side?
                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13432161#comment-13432161 ] 

Francis De Brabandere commented on EMPIREDB-156:
------------------------------------------------

isutf8 doap_Empire-db.rdf 
echo $?
0


"If the file is valid UTF-8, the exit status is zero. If the file is not valid UTF-8, or there is some error, the exit status is non-zero."
                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Sebb (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435260#comment-13435260 ] 

Sebb commented on EMPIREDB-156:
-------------------------------

I've looked into this further, and the problem is that the HTTP response does not specify the charset.
The default HTTP charset is ISO-8859-1, not UTF-8, so in a way there is a problem your end.

The script currently attempts to decode the response content before writing it to the local file copy; this causes the problem as the resulting file is not UTF-8.

I suspect the content decoding is unnecessary - we just want a binary copy of the file - so I'll try with that.

If that causes problems elsewhere, a work-round would be to move the DOAP to SVN and set the appropriate content-type there.

c.f for example:

http://svn.apache.org/repos/asf/harmony/standard/doap_Harmony.rdf
http://svn.apache.org/repos/asf/webservices/commons/trunk/modules/axiom/etc/axiom.rdf

both of the above responses are UTF-8

                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435402#comment-13435402 ] 

Francis De Brabandere commented on EMPIREDB-156:
------------------------------------------------

the file is also available in svn:
http://svn.apache.org/repos/asf/empire-db/site/doap_Empire-db.rdf

I set the content-type to application/rdf+xml

Is this enough?

curl --head http://svn.apache.org/repos/asf/empire-db/site/doap_Empire-db.rdf
HTTP/1.1 200 OK
Date: Wed, 15 Aug 2012 18:48:16 GMT
Server: Apache/2.2.17 (Unix) mod_ssl/2.2.17 OpenSSL/1.0.0c DAV/2 mod_wsgi/3.1 Python/2.6.6 SVN/1.7.0
Last-Modified: Wed, 15 Aug 2012 18:48:07 GMT
ETag: "1373564//empire-db/site/doap_Empire-db.rdf"
Accept-Ranges: bytes
Content-Length: 2954
Vary: Accept-Encoding
Content-Type: application/rdf+xml

                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Sebb (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435450#comment-13435450 ] 

Sebb commented on EMPIREDB-156:
-------------------------------

I've updated the script; hopefully it is now charset-agnostic. It ran OK for me locally.

As to the SVN content-type: no, that would not be enough.
Try curling the other SVN examples I gave and you'll see what they return.
                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435855#comment-13435855 ] 

Francis De Brabandere commented on EMPIREDB-156:
------------------------------------------------

I suppose this can be closed now:

curl --head http://svn.apache.org/repos/asf/empire-db/site/doap_Empire-db.rdf
HTTP/1.1 200 OK
Date: Thu, 16 Aug 2012 09:15:38 GMT
Server: Apache/2.2.17 (Unix) mod_ssl/2.2.17 OpenSSL/1.0.0c DAV/2 mod_wsgi/3.1 Python/2.6.6 SVN/1.7.0
Last-Modified: Thu, 16 Aug 2012 08:38:17 GMT
ETag: "1373756//empire-db/site/doap_Empire-db.rdf"
Accept-Ranges: bytes
Content-Length: 2954
Vary: Accept-Encoding
Content-Type: application/rdf+xml; charset=UTF-8

For reference:
svn propset svn:mime-type "application/rdf+xml; charset=UTF-8" doap_Empire-db.rdf

                
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Resolved] (EMPIREDB-156) DOAP file has encoding error

Posted by "Francis De Brabandere (JIRA)" <em...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/EMPIREDB-156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Francis De Brabandere resolved EMPIREDB-156.
--------------------------------------------

    Resolution: Fixed
    
> DOAP file has encoding error
> ----------------------------
>
>                 Key: EMPIREDB-156
>                 URL: https://issues.apache.org/jira/browse/EMPIREDB-156
>             Project: Empire-DB
>          Issue Type: Bug
>            Reporter: Sebb
>            Assignee: Francis De Brabandere
>
> The project build is failing with:
> /usr/local/bin/xsltproc  /x1/home/apsite/wrkdir/projects/templates/projectName.xsl /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf:46: parser error : Input is not proper UTF-8, indicate encoding !
> Bytes: 0xF6 0x62 0x65 0x6C
>         <foaf:name>Rainer Döbele</foaf:name>
>                            ^
> unable to parse /x1/home/apsite/wrkdir/projects/work/doap_temp_3.rdf
> I've disabled the entry:
> URL: http://svn.apache.org/viewvc?rev=1371420&view=rev
> Log:
> empire-db has encoding error
> Modified:
>     infrastructure/site-tools/trunk/projects/files.xml
> Please fix the DOAP, and you can then re-enable the entry.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira