You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Michael Klink (Jira)" <ji...@apache.org> on 2021/06/16 16:20:00 UTC

[jira] [Commented] (PDFBOX-5216) Is there a way to optimize by cleaning up duplicate objects?

    [ https://issues.apache.org/jira/browse/PDFBOX-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17364364#comment-17364364 ] 

Michael Klink commented on PDFBOX-5216:
---------------------------------------

Have you tried something along the lines of [https://stackoverflow.com/a/53544377/1729265] yet? (I haven't tested that code with newer versions of PDFBox yet, so there might be something to clean up, in particular in combination with the PDFBox 3.0 pre-releases, but it should be a start.)

 

> Is there a way to optimize by cleaning up duplicate objects?
> ------------------------------------------------------------
>
>                 Key: PDFBOX-5216
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5216
>             Project: PDFBox
>          Issue Type: Wish
>            Reporter: yoonho
>            Priority: Major
>         Attachments: 스크린샷 2021-06-15 오후 2.02.21.png
>
>
> Is there a way to clean up duplicate objects using PDFBox?
> [http://gofile.me/4hSqO/Cis33w0Sa] - Original
> [http://gofile.me/4hSqO/7XKmWqUBB]  - Clean version
> I applied the Adobe DC's Optimize option (relevant in the attached file). As a result, a 48mb PDF file was reduced to 19mb. I think this is due to cleaning up duplicate objects in the PDF.
> Am I right? I would like to implement this process with PDFBox. How should I approach it?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org