You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2011/02/02 22:37:30 UTC
[jira] Issue Comment Edited: (PDFBOX-955) Can't extract b/w images
from PDF
[ https://issues.apache.org/jira/browse/PDFBOX-955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12989806#comment-12989806 ]
Tilman Hausherr edited comment on PDFBOX-955 at 2/2/11 9:36 PM:
----------------------------------------------------------------
Yes we use CCITT Fax4 a lot. I did now add jai_core.jar and jai_codec.jar (from JAI 1.3 at http://java.sun.com/javase/technologies/desktop/media/jai/) to the lib list. However this didn't change anything. Then I removed both libs and took your URL (which is outdated) and added jai_imageio_windows-i586.jar to the lib list instead. This didn't help either :-(
was (Author: tilman):
Yes we use CCITT Fax4 a lot. I did now add jai_core.jar and jai_codec.jar (from JAI 1.3 at http://java.sun.com/javase/technologies/desktop/media/jai/) to the lib list. However this didn't change anything. Then I removed both libs and took your URL (which is outdated) and added jai_imageio_windows-i586.jar to the lib list. This didn't help either :-(
> Can't extract b/w images from PDF
> ---------------------------------
>
> Key: PDFBOX-955
> URL: https://issues.apache.org/jira/browse/PDFBOX-955
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.4.0
> Environment: Windows XP prof, Java 1.6.0_22, Netbeans 6.9.1
> Reporter: Tilman Hausherr
> Priority: Blocker
> Labels: extract
> Attachments: ExtractImages.java, d0000040-01.png, d0000040.pdf
>
>
> I wrote a test application using org.apache.pdfbox.ExtractImages to... extract images as PNG. (This is the start of something bigger, which involves making a statistic about the content of over a million pages within PDF files) However all images I get are all black or all white when I test on our own PDF files. I did get correct images from a file that had color images. To extract, I tried page.convertToImage() and then writing with ImageIO.write(), but I also tried using PDFImageWriter, neither had success for b/w images.
> The sample PDF is not confidential; it does give a warning "getRGBImage returned NULL" but other PDFs that don't give the warning (but are confidential) also fail.
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira