You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2015/04/03 16:31:53 UTC

[jira] [Commented] (PDFBOX-2742) PDFSplit ignores global resources

    [ https://issues.apache.org/jira/browse/PDFBOX-2742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14394478#comment-14394478 ] 

Tilman Hausherr commented on PDFBOX-2742:
-----------------------------------------

We have a choice between two evils:
1. ignore global resources, as done since PDFBOX-28
2. not ignore global resources, as done in 2.0. File 239665 of PDFBOX-2101 produces extremely large files because in that file, everything is a global resource. The original and each split are 4MB large.

Possible solution for version 2.0: use code like in the ExtractImages class to find out what resources are really used. However I don't know if that code would also "visit" all nested streams (e.g. in forms, annotations, etc).



> PDFSplit ignores global resources
> ---------------------------------
>
>                 Key: PDFBOX-2742
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2742
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Utilities, Writing
>    Affects Versions: 1.8.9, 1.8.10
>         Environment: Linux+Java (Oracle)
>            Reporter: Wolfgang S. Kechel
>             Fix For: 1.8.10
>
>         Attachments: original.pdf
>
>
> When using PDFsplit to split the attached one page doc in single pages the result page is incomplete.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org