You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Mario Jerome Kofler (Jira)" <ji...@apache.org> on 2021/12/23 14:21:00 UTC

[jira] [Commented] (PDFBOX-5317) Splitter: Problematic /Info causes big files

    [ https://issues.apache.org/jira/browse/PDFBOX-5317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17464609#comment-17464609 ] 

Mario Jerome Kofler commented on PDFBOX-5317:
---------------------------------------------

Unfortunately, there are still PDFs for which the Splitter does not work correctly,  i.e. the splitted documents have the same (full) size as the original document. I identified two cases:
 * PDFs created via the online PDF converter [https://online2pdf.com/]
 * PDFs created using the PHP library fpdf [http://www.fpdf.org/]

In these two cases (and possibly even more  PDF converter) the split pages still have the full size of the original document. Attached you find the original attachment for this report converted to PDF again using the [https://online2pdf.com/] tool for testing.

> Splitter: Problematic /Info causes big files
> --------------------------------------------
>
>                 Key: PDFBOX-5317
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5317
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.24
>            Reporter: Oliver Schmidtmer
>            Assignee: Tilman Hausherr
>            Priority: Major
>             Fix For: 2.0.25, 3.0.0 PDFBox
>
>         Attachments: graustufen 200dp1i-1.pdf, graustufen 200dp1i.pdf
>
>
> The attached pdf uses the same object for /Root and /Info, so /Info also contains the /Pages reference.
> When splitting this document, /Info gets a copy of the original object, which still contains all pages.
> I would propose to create a new /Info entry instead of just copying the old one.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org