You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "yoonho (Jira)" <ji...@apache.org> on 2021/06/15 05:12:00 UTC

[jira] [Created] (PDFBOX-5216) Is there a way to optimize by cleaning up duplicate objects?

yoonho created PDFBOX-5216:
------------------------------

             Summary: Is there a way to optimize by cleaning up duplicate objects?
                 Key: PDFBOX-5216
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5216
             Project: PDFBox
          Issue Type: Wish
            Reporter: yoonho
         Attachments: 스크린샷 2021-06-15 오후 2.02.21.png

Is there a way to clean up duplicate objects using PDFBox?

[http://gofile.me/4hSqO/Cis33w0Sa] - Original

[http://gofile.me/4hSqO/7XKmWqUBB]  - Clean version

I applied the Adobe DC's Optimize option (there is a picture of it). As a result, a 48mb PDF file was reduced to 19mb. I think this is due to cleaning up duplicate objects in the PDF.

Am I right? I would like to implement this process with PDFBox. How should I approach it?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org