You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Roel Pieters (JIRA)" <ji...@apache.org> on 2011/05/11 15:35:47 UTC

[jira] [Issue Comment Edited] (PDFBOX-955) Can't extract b/w images from PDF

    [ https://issues.apache.org/jira/browse/PDFBOX-955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13031625#comment-13031625 ] 

Roel Pieters edited comment on PDFBOX-955 at 5/11/11 1:34 PM:
--------------------------------------------------------------

I have the blank image problem still. Also with jai_imageio.jar. I attached photo.pdf and photo.jpg. Do I just need the jar in my class path or do I need to install the lib for my operating system as well?

      was (Author: roelpi):
    I have the blank image problem still. Also with jai_imageio.jar. I attached photo.pdf and photo.jpg.
  
> Can't extract b/w images from PDF
> ---------------------------------
>
>                 Key: PDFBOX-955
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-955
>             Project: PDFBox
>          Issue Type: Improvement
>    Affects Versions: 1.4.0
>         Environment: Windows XP prof, Java 1.6.0_22, Netbeans 6.9.1
>            Reporter: Tilman Hausherr
>            Priority: Minor
>              Labels: extract
>         Attachments: ExtractImages.java, d0000040-01.png, d0000040.pdf, photo.jpg, photo.pdf
>
>
> I wrote a test application using org.apache.pdfbox.ExtractImages to... extract images as PNG. (This is the start of something bigger, which involves making a statistic about the content of over a million pages within PDF files) However all images I get are all black or all white when I test on our own PDF files. I did get correct images from a file that had color images. To extract, I tried page.convertToImage() and then writing with ImageIO.write(), but I also tried using PDFImageWriter, neither had success for b/w images.
> The sample PDF is not confidential; it does give a warning "getRGBImage returned NULL" but other PDFs that don't give the warning (but are confidential) also fail.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira