You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Michael Klink (Jira)" <ji...@apache.org> on 2021/06/16 16:20:00 UTC
[jira] [Commented] (PDFBOX-5216) Is there a way to optimize by
cleaning up duplicate objects?
[ https://issues.apache.org/jira/browse/PDFBOX-5216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17364364#comment-17364364 ]
Michael Klink commented on PDFBOX-5216:
---------------------------------------
Have you tried something along the lines of [https://stackoverflow.com/a/53544377/1729265] yet? (I haven't tested that code with newer versions of PDFBox yet, so there might be something to clean up, in particular in combination with the PDFBox 3.0 pre-releases, but it should be a start.)
> Is there a way to optimize by cleaning up duplicate objects?
> ------------------------------------------------------------
>
> Key: PDFBOX-5216
> URL: https://issues.apache.org/jira/browse/PDFBOX-5216
> Project: PDFBox
> Issue Type: Wish
> Reporter: yoonho
> Priority: Major
> Attachments: 스크린샷 2021-06-15 오후 2.02.21.png
>
>
> Is there a way to clean up duplicate objects using PDFBox?
> [http://gofile.me/4hSqO/Cis33w0Sa] - Original
> [http://gofile.me/4hSqO/7XKmWqUBB] - Clean version
> I applied the Adobe DC's Optimize option (relevant in the attached file). As a result, a 48mb PDF file was reduced to 19mb. I think this is due to cleaning up duplicate objects in the PDF.
> Am I right? I would like to implement this process with PDFBox. How should I approach it?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org