You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Michael Klink (Jira)" <ji...@apache.org> on 2022/01/04 21:12:00 UTC

[jira] [Commented] (PDFBOX-5355) remove page from pdf with image violate conformance level pdf1.7

    [ https://issues.apache.org/jira/browse/PDFBOX-5355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17468853#comment-17468853 ] 

Michael Klink commented on PDFBOX-5355:
---------------------------------------

Your instruction {{boxDocument.removePage(0)}} merely removes an object from the page tree. It does _not_ look for references to that page and clean them up, though.

In this case there are references to that page in the document structure tree which remain after your instruction. Thus, they do not point anymore to an actual document page anymore. (Similarly links to that page etc. are not removed...)

Please be aware that many PDFBox instructions are low-level instructions and leave clean-up tasks to you.

> remove page from pdf with image violate conformance level pdf1.7
> ----------------------------------------------------------------
>
>                 Key: PDFBOX-5355
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5355
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 2.0.20, 3.0.0 JBIG2
>            Reporter: lappa-lappa
>            Priority: Major
>         Attachments: pdf_result.pdf, with_image.pdf
>
>
> open [https://www.pdf-online.com/osa/validate.aspx] and upload "with_image.pdf" document, validation passed
> Execute following code (update absolute paths to files):
> {{{}byte[] withImage = readFile("C:/r/{}}}{{{}pdf/{}}}{{{}with_image.pdf");{}}}
> {{try (PDDocument boxDocument = Loader.loadPDF(withImage)) {}}
> {{  boxDocument.removePage(0);}}
>     try (ByteArrayOutputStream bos = new ByteArrayOutputStream()) {
>         boxDocument.save(bos);
> {{    byte[] pdfBytes = bos.toByteArray();}}
> {{{}    Files.write(Path.of("C:/r/{}}}{{{}pdf/{}}}{{{}pdf_result.pdf"), pdfBytes);{}}}
>     }
> {{} catch (IOException e) {}}
> {{{}  e.printS{}}}tackTrace();
> {{}}}
> {{upload pdf_result.pdf into [https://www.pdf-online.com/osa/validate.aspx] , validation is not passed.}}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org