You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "John Logan (JIRA)" <ji...@apache.org> on 2015/12/05 00:45:11 UTC

[jira] [Created] (PDFBOX-3153) Direct JPEG extraction results in invalid images in 2.0.0 releases.

John Logan created PDFBOX-3153:
----------------------------------

             Summary: Direct JPEG extraction results in invalid images in 2.0.0 releases.
                 Key: PDFBOX-3153
                 URL: https://issues.apache.org/jira/browse/PDFBOX-3153
             Project: PDFBox
          Issue Type: Bug
          Components: PDModel
    Affects Versions: 2.0.0
         Environment: Observed on both Linux and Mac
            Reporter: John Logan


When I run pdfbox-app ExtractImages on a PDF containing an image with a DeviceRGB colorspace, the resulting JPEG file is very large (5.3MB, while the source PDF is 320KB).

I see this with the 2.0.0-RC2 release, I also encounter the problem with a build from today's trunk.

If I modify the code to force usage of ImageIO, a valid JPEG file results.

The image extracts properly in the 1.8.10 version.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org