You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2011/05/29 19:33:47 UTC

[jira] [Issue Comment Edited] (PDFBOX-1018) Remove imageIO dependency (was: PDPage convertToImage bug creates white images from black and white pdf files.)

    [ https://issues.apache.org/jira/browse/PDFBOX-1018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13040661#comment-13040661 ] 

Andreas Lehmkühler edited comment on PDFBOX-1018 at 5/29/11 5:33 PM:
---------------------------------------------------------------------

I agree with Roland concerning the error message. We should change it to something more meaningful.

Due to the license issue we can't bundle pdfbox with the mentioned jar so that it is a good idea to use something else. IMO it doesn't make sense to add a new dependency only to use a very small piece of it. But the more important fact is that AFAIK sanslan doesn't support the needed CCITTFaxDecoder.

I'm experimenting with the TIFFFaxDecoder which is part of Apache XMLGraphics [1]. It works with your pdf but fails with others. I've to dig deeper into it and try to understand decode algo.

[1] http://svn.apache.org/viewvc/xmlgraphics/commons/trunk/src/java/org/apache/xmlgraphics/image/codec/tiff/TIFFFaxDecoder.java?view=markup

      was (Author: lehmi):
    I agree with Roland concerning the error message. We should change it to something more meaningful.

Due to the license issue we can't bundle pdfbox with the mentioned jar so that it is a good idea to use something else. IMO it doesn't make sense to add a new dependency only to use a very small piece of it. But the more important fact is that AFAIK sanslan doesn't support the needed CCITTFaxDecoder.

I'm experimenting with the TIFFFaxDecoder which is part of Apache XMLGraphics [1]. It works with your pdf but fails with others. I've to dig deeper into it and try to understand decode algo.

[1] http://svn.apache.org/viewvc/xmlgraphics/commons/trunk/src/java/org/apache/xmlgraphics/image/codec/tiff/TIFFImageEncoder.java?view=markup
  
> Remove imageIO dependency (was: PDPage convertToImage bug creates white images from black and white pdf files.)
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1018
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1018
>             Project: PDFBox
>          Issue Type: Improvement
>    Affects Versions: 1.2.0, 1.2.1, 1.3.1, 1.4.0, 1.5.0
>         Environment: JDK 1.6.0_22
>            Reporter: Roland Quast
>            Assignee: Andreas Lehmkühler
>            Priority: Critical
>              Labels: pdfbox
>         Attachments: BlackAndWhiteBug.java, ColorWorks.java, PDFBOX1018-black_and_white1.png, black_and_white.pdf, color.pdf
>
>
> This bug has been reported in various other tickets submitted before. I am attempting to conclusively prove that this is an issue, and it needs to be attended to since all past tickets regarding this bug have been marked invalid.
> I have attached a video showing very basic code that will reproduce the issue. I have also attached the code that causes the issue, as well as a PDF file that works (a color one), and a black and white PDF file that doesn't.
> The main issue is that when reading a black and white PDF file (see attached black and white pdf file), the following message is displayed, and the contents of the output image is completely white.
> 26/05/2011 3:20:14 PM org.apache.pdfbox.util.operator.pagedrawer.Invoke process
> WARNING: getRGBImage returned NULL
> We use PDFBox in our program for reading PDF files, and at least 50 percent of our customer's PDF files (from different scanners) will not read because of this issue. This is a complete show stopper, and I'd be more than happy to help in any way I could to resolve it.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira