You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2014/05/27 17:31:02 UTC

[jira] [Commented] (PDFBOX-1075) Can't get images from a PDF

    [ https://issues.apache.org/jira/browse/PDFBOX-1075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14009797#comment-14009797 ] 

Tilman Hausherr commented on PDFBOX-1075:
-----------------------------------------

Did anybody keep the mentioned file? The original link is broken; there are several files with that name on the web, including one in Thai.

> Can't get images from a PDF
> ---------------------------
>
>                 Key: PDFBOX-1075
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1075
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.6.0, 1.7.0
>            Reporter: Antoni Mylka
>            Assignee: Andreas Lehmkühler
>             Fix For: 1.7.0
>
>
> This is a regression. In 1.4.0 I was able to extract images from a PDF file. In 1.6 and the current trunk I get exceptions:
> SEVERE: java.lang.IllegalArgumentException: Raster BytePackedRaster: width = 1000 height = 32 #channels 1 xOff = 0 yOff = 0 is incompatible with ColorModel IndexColorModel: #pixelBits = 1 numComponents = 3 color space = java.awt.color.ICC_ColorSpace@1050169 transparency = 1 transIndex   = -1 has alpha = false isAlphaPre = false
> java.lang.IllegalArgumentException: Raster BytePackedRaster: width = 1000 height = 32 #channels 1 xOff = 0 yOff = 0 is incompatible with ColorModel IndexColorModel: #pixelBits = 1 numComponents = 3 color space = java.awt.color.ICC_ColorSpace@1050169 transparency = 1 transIndex   = -1 has alpha = false isAlphaPre = false
> 	at java.awt.image.BufferedImage.<init>(BufferedImage.java:611)
> 	at org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:252)
> 	at org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:289)
> 	at org.apache.pdfbox.TestGovdocs148902.test148902(TestGovdocs148902.java:58)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> 	at java.lang.reflect.Method.invoke(Method.java:597)
> 	at junit.framework.TestCase.runTest(TestCase.java:168)
> 	at junit.framework.TestCase.runBare(TestCase.java:134)
> 	at junit.framework.TestResult$1.protect(TestResult.java:110)
> 	at junit.framework.TestResult.runProtected(TestResult.java:128)
> 	at junit.framework.TestResult.run(TestResult.java:113)
> 	at junit.framework.TestCase.run(TestCase.java:124)
> 	at junit.framework.TestSuite.runTest(TestSuite.java:232)
> 	at junit.framework.TestSuite.run(TestSuite.java:227)
> 	at org.junit.internal.runners.JUnit38ClassRunner.run(JUnit38ClassRunner.java:83)
> 	at org.eclipse.jdt.internal.junit4.runner.JUnit4TestReference.run(JUnit4TestReference.java:49)
> 	at org.eclipse.jdt.internal.junit.runner.TestExecution.run(TestExecution.java:38)
> 	at org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.runTests(RemoteTestRunner.java:467)
> 	at org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.runTests(RemoteTestRunner.java:683)
> 	at org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.run(RemoteTestRunner.java:390)
> 	at org.eclipse.jdt.internal.junit.runner.RemoteTestRunner.main(RemoteTestRunner.java:197)
> http://domex.nps.edu/corp/files/govdocs1/148/148902.pdf
> I pasted it into the src/test/resources/pdfparser folder and run a test case like this:
> public class TestGovdocs148902 extends TestCase
> {
>     public void test148902() throws IOException {
>         PDDocument doc = PDDocument.load( "src/test/resources/pdfparser/148902.pdf");
>         int imageCounter = 0;
>         COSDocument cosDoc = doc.getDocument();
>         List<COSObject> list = cosDoc.getObjectsByType(COSName.XOBJECT);
>         
>         for (COSObject cosOb : list) {
>             COSBase baseObject = cosOb.getObject();
>             if (baseObject != null && baseObject instanceof COSStream) {
>                 COSStream st = (COSStream)baseObject;
>                 String subtype = st.getNameAsString(COSName.SUBTYPE);
>                 if (subtype != null && subtype.equalsIgnoreCase("image")) {
>                     PDXObjectImage ximage = (PDXObjectImage)PDXObject.createXObject( st );
>                     if (ximage != null && ximage.getWidth() >= 5 && ximage.getHeight() >= 5) {
>                         ByteArrayOutputStream baos = new ByteArrayOutputStream();
>                         ximage.write2OutputStream(baos);
>                         byte [] bytes = baos.toByteArray();
>                         if (bytes.length > 0) {
>                             imageCounter++;
>                         }
>                     }
>                 }
>             }
>         }
>         
>         assertEquals(32, imageCounter);
>     }
> }
> The test cases passes in 1.4.



--
This message was sent by Atlassian JIRA
(v6.2#6252)