You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2017/10/03 14:15:00 UTC

[jira] [Commented] (PDFBOX-3951) Pages lost

    [ https://issues.apache.org/jira/browse/PDFBOX-3951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16189753#comment-16189753 ] 

Andreas Lehmkühler commented on PDFBOX-3951:
--------------------------------------------

The attached pdf is truncated. The document was incrementally updated, but a lot of the updated objects are lost. 2.0.7 wasn't able to read the updated objects and simply read the origin version of the pdf only. The current code reads all readable objects and updates most of the objects including the pages dictionaries. Unfortunately most of the page content streams are gone which leads to the lost pages.

Saying that, 2.0.7 produces a seemingly correct result but it isn't and 2.0.8. gets the best out of the truncated pdf. IMHO, this isn't a regression, but an improvement how silly that might sounds.

Evince on linux gives the same result and acrobat (the old linux version 9) can't open/repair the pdf.


> Pages lost
> ----------
>
>                 Key: PDFBOX-3951
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3951
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.8
>            Reporter: Tilman Hausherr
>              Labels: regression
>         Attachments: FIHUZWDDL2VGPOE34N6YHWSIGSH5LVGZ.pdf
>
>
> Pages are lost that were in 2.0.7. These are blank. This starts with page 20.
> Same with file NJTRIAYPQAAG3CYVDRVG34PC6R367X7F
> CRX4MZIDRTB4C5N5F4DATJX2PBAGW6ES
> QSRWIZTTYRM2DV7IP6THTSHS74SFQH3V



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org