You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Eric Leleu (JIRA)" <ji...@apache.org> on 2013/08/18 15:28:47 UTC

[jira] [Resolved] (PDFBOX-1674) Preflight doesn't correctly parse PDF if obj identifier not followed by line terminator

     [ https://issues.apache.org/jira/browse/PDFBOX-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eric Leleu resolved PDFBOX-1674.
--------------------------------

    Resolution: Fixed
      Assignee: Eric Leleu

Fix in revision r1515101
                
> Preflight doesn't correctly parse PDF if obj identifier not followed by line terminator
> ---------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1674
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1674
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Preflight
>    Affects Versions: 2.0.0
>         Environment: Win 7
>            Reporter: Johan van der Knijff
>            Assignee: Eric Leleu
>            Priority: Minor
>             Fix For: 2.0.0
>
>
> For some test files on the Adobe Acrobat Engineering website, Preflight output looks like this:
> <preflight name="Disney-Flash.pdf">
>   <executionTimeMS>210</executionTimeMS>
>   <isValid type="">false</isValid>
>   <errors count="3">
>     <error count="1">
>       <code>1.0</code>
>       <details>Syntax error, Expected pattern 'obj but missed at character 'o'</details>
>     </error>
>     <error count="1">
>       <code>1.2.1</code>
>       <details>Body Syntax error, Expected pattern 'obj but missed at character 'o'</details>
>     </error>
>     <error count="1">
>       <code>1.2.1</code>
>       <details>Body Syntax error, Single space expected</details>
>     </error>
>   </errors>
> </preflight>
> Which suggests that Preflight doesn't correctly parse the objects. This is confirmed by a look at some of the offending PDFs in a hex editor, which reveals that the object identifiers in them are not terminated by any EOL character(s). AFAIK this is allowed in both PDF and PDF/A-1. More details + links to test files here ('Multimedia' table and below):
> http://www.openplanetsfoundation.org/blogs/2013-07-25-identification-pdf-preservation-risks-sequel

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira