You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/08/30 14:47:55 UTC

[jira] Created: (PDFBOX-806) Failure to extract dc:description when the value is the node text

Failure to extract dc:description when the value is the node text
-----------------------------------------------------------------

                 Key: PDFBOX-806
                 URL: https://issues.apache.org/jira/browse/PDFBOX-806
             Project: PDFBox
          Issue Type: Bug
          Components: JempBox
    Affects Versions: 1.2.1
            Reporter: Staffan Olsson


I can not get the dc:description from the following xmpmeta:
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0">
   <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
      <rdf:Description rdf:about=""
            xmlns:exif="http://ns.adobe.com/exif/1.0/">
         <exif:UserComment>exif</exif:UserComment>
      </rdf:Description>
      <rdf:Description rdf:about=""
            xmlns:dc="http://purl.org/dc/elements/1.1/">
         <dc:description>from XnViewMP 0.26</dc:description>
      </rdf:Description>
   </rdf:RDF>
</x:xmpmeta>

But it works from this:
<x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
 <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
  <rdf:Description rdf:about=""
    xmlns:dc="http://purl.org/dc/elements/1.1/">
   <dc:description>
    <rdf:Alt>
     <rdf:li xml:lang="x-default">dc:description</rdf:li>
    </rdf:Alt>
   </dc:description>
  </rdf:Description>
 </rdf:RDF>
</x:xmpmeta>

Both of them are valid according to http://www.w3.org/RDF/Validator/. My interpretation of http://dublincore.org/documents/dcmes-xml/ is that the first format should be ok for Dublin Core too.

See attached unit test.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PDFBOX-806) Failure to extract dc:description when the value is the node text

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PDFBOX-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12915752#action_12915752 ] 

Staffan Olsson commented on PDFBOX-806:
---------------------------------------

Fixed in github fork, http://github.com/solsson/pdfbox/commit/8815bfae058dd4469222f9aeba3312636d887c0b

> Failure to extract dc:description when the value is the node text
> -----------------------------------------------------------------
>
>                 Key: PDFBOX-806
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-806
>             Project: PDFBox
>          Issue Type: Bug
>          Components: JempBox
>    Affects Versions: 1.2.1
>            Reporter: Staffan Olsson
>         Attachments: JempboxTest.java
>
>
> I can not get the dc:description from the following xmpmeta:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0">
>    <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>       <rdf:Description rdf:about=""
>             xmlns:exif="http://ns.adobe.com/exif/1.0/">
>          <exif:UserComment>exif</exif:UserComment>
>       </rdf:Description>
>       <rdf:Description rdf:about=""
>             xmlns:dc="http://purl.org/dc/elements/1.1/">
>          <dc:description>from XnViewMP 0.26</dc:description>
>       </rdf:Description>
>    </rdf:RDF>
> </x:xmpmeta>
> But it works from this:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
>  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>   <rdf:Description rdf:about=""
>     xmlns:dc="http://purl.org/dc/elements/1.1/">
>    <dc:description>
>     <rdf:Alt>
>      <rdf:li xml:lang="x-default">dc:description</rdf:li>
>     </rdf:Alt>
>    </dc:description>
>   </rdf:Description>
>  </rdf:RDF>
> </x:xmpmeta>
> Both of them are valid according to http://www.w3.org/RDF/Validator/. My interpretation of http://dublincore.org/documents/dcmes-xml/ is that the first format should be ok for Dublin Core too.
> See attached unit test.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PDFBOX-806) Failure to extract dc:description when the value is the node text

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PDFBOX-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12915754#action_12915754 ] 

Staffan Olsson commented on PDFBOX-806:
---------------------------------------

Added Apache license header to the new test in http://github.com/solsson/pdfbox/commit/a6df04c97e0f129f06884ebe55e959950db41d8d

> Failure to extract dc:description when the value is the node text
> -----------------------------------------------------------------
>
>                 Key: PDFBOX-806
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-806
>             Project: PDFBox
>          Issue Type: Bug
>          Components: JempBox
>    Affects Versions: 1.2.1
>            Reporter: Staffan Olsson
>         Attachments: JempboxTest.java
>
>
> I can not get the dc:description from the following xmpmeta:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0">
>    <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>       <rdf:Description rdf:about=""
>             xmlns:exif="http://ns.adobe.com/exif/1.0/">
>          <exif:UserComment>exif</exif:UserComment>
>       </rdf:Description>
>       <rdf:Description rdf:about=""
>             xmlns:dc="http://purl.org/dc/elements/1.1/">
>          <dc:description>from XnViewMP 0.26</dc:description>
>       </rdf:Description>
>    </rdf:RDF>
> </x:xmpmeta>
> But it works from this:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
>  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>   <rdf:Description rdf:about=""
>     xmlns:dc="http://purl.org/dc/elements/1.1/">
>    <dc:description>
>     <rdf:Alt>
>      <rdf:li xml:lang="x-default">dc:description</rdf:li>
>     </rdf:Alt>
>    </dc:description>
>   </rdf:Description>
>  </rdf:RDF>
> </x:xmpmeta>
> Both of them are valid according to http://www.w3.org/RDF/Validator/. My interpretation of http://dublincore.org/documents/dcmes-xml/ is that the first format should be ok for Dublin Core too.
> See attached unit test.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PDFBOX-806) Failure to extract dc:description when the value is the node text

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Staffan Olsson updated PDFBOX-806:
----------------------------------

    Attachment: JempboxTest.java

> Failure to extract dc:description when the value is the node text
> -----------------------------------------------------------------
>
>                 Key: PDFBOX-806
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-806
>             Project: PDFBox
>          Issue Type: Bug
>          Components: JempBox
>    Affects Versions: 1.2.1
>            Reporter: Staffan Olsson
>         Attachments: JempboxTest.java
>
>
> I can not get the dc:description from the following xmpmeta:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0">
>    <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>       <rdf:Description rdf:about=""
>             xmlns:exif="http://ns.adobe.com/exif/1.0/">
>          <exif:UserComment>exif</exif:UserComment>
>       </rdf:Description>
>       <rdf:Description rdf:about=""
>             xmlns:dc="http://purl.org/dc/elements/1.1/">
>          <dc:description>from XnViewMP 0.26</dc:description>
>       </rdf:Description>
>    </rdf:RDF>
> </x:xmpmeta>
> But it works from this:
> <x:xmpmeta xmlns:x="adobe:ns:meta/" x:xmptk="XMP Core 4.4.0-Exiv2">
>  <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
>   <rdf:Description rdf:about=""
>     xmlns:dc="http://purl.org/dc/elements/1.1/">
>    <dc:description>
>     <rdf:Alt>
>      <rdf:li xml:lang="x-default">dc:description</rdf:li>
>     </rdf:Alt>
>    </dc:description>
>   </rdf:Description>
>  </rdf:RDF>
> </x:xmpmeta>
> Both of them are valid according to http://www.w3.org/RDF/Validator/. My interpretation of http://dublincore.org/documents/dcmes-xml/ is that the first format should be ok for Dublin Core too.
> See attached unit test.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.