You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Timo Boehme (JIRA)" <ji...@apache.org> on 2012/06/13 23:53:42 UTC

[jira] [Updated] (PDFBOX-1099) Only parsing object streams if they are referenced by the xref table / stream

     [ https://issues.apache.org/jira/browse/PDFBOX-1099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Timo Boehme updated PDFBOX-1099:
--------------------------------

    Attachment: 2012-06-13_COSDocument_xrefObjStream.patch

patch will add objects from object streams even if such an object already exists but the object from stream is referenced in xref table
                
> Only parsing object streams if they are referenced by the xref table / stream
> -----------------------------------------------------------------------------
>
>                 Key: PDFBOX-1099
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1099
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>            Reporter: Thomas Chojecki
>            Assignee: Timo Boehme
>         Attachments: 2012-06-13_COSDocument_xrefObjStream.patch
>
>
> Some pdf documents have objects streams and don't reference them through the xref table / stream. To prevent the stream parser to dereference such object streams, we need to implement the type 2 part (case 2) inside the PDFXRefStreamParser and store the objects inside a map. This will take some load from the stream parser (see PDFBOX-1098) and causes less failures while parsing a document.
> A sample pdf can be get from the issue PDFBOX-1098 and a patch is coming soon. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira