You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Nghi Nguyen (Created) (JIRA)" <ji...@apache.org> on 2012/02/29 03:21:57 UTC

[jira] [Created] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Using PDFToImage creates blank images on pages.
-----------------------------------------------

                 Key: PDFBOX-1239
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
             Project: PDFBox
          Issue Type: Bug
          Components: PDModel
    Affects Versions: 1.6.0
         Environment: Windows XP and java version 1.6.0_29
            Reporter: Nghi Nguyen
             Fix For: 1.7.0


I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. Here is the link for a sample pdf http://www.dtic.mil/doctrine/jel/service_pubs/afdd2_1_1.pdf
Below I have used this command and some of the output.

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Nghi Nguyen (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nghi Nguyen updated PDFBOX-1239:
--------------------------------

    Attachment: afdd2_1_110.png
                afdd2_1_11.png
                afdd2_1_1.pdf

PDF causing some problems.
                
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: PDFBox, PDFToImage
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. Here is the link for a sample pdf http://www.dtic.mil/doctrine/jel/service_pubs/afdd2_1_1.pdf
> Below I have used this command and some of the output.
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Nghi Nguyen (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nghi Nguyen updated PDFBOX-1239:
--------------------------------

    Description: 
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output.

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



  was:
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. Here is the link for a sample pdf http://www.dtic.mil/doctrine/jel/service_pubs/afdd2_1_1.pdf
Below I have used this command and some of the output.

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



    
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: PDFBox, PDFToImage
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output.
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Valerio Santinelli (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13240288#comment-13240288 ] 

Valerio Santinelli commented on PDFBOX-1239:
--------------------------------------------

I'm experiencing the same problem with a PDF with embedded subset fonts.. I'm attaching it to this issue as well.

                
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: Image, PDFBox, PDFToImage, Rendering
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png, iTabloPDFPlusUserGuide.pdf
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to PDFBOX-1095
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Andreas Lehmkühler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler resolved PDFBOX-1239.
----------------------------------------

       Resolution: Not A Problem
    Fix Version/s:     (was: 1.7.0)
         Assignee: Andreas Lehmkühler

You have to add an ImageIO decoder for JPEG2000 as the pdf uses JPXEncoded streams, see PDFBOX-554 for further details.
                
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>            Assignee: Andreas Lehmkühler
>              Labels: Image, PDFBox, PDFToImage, Rendering
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png, iTabloPDFPlusUserGuide.pdf
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to PDFBOX-1095
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Updated] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Nghi Nguyen (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nghi Nguyen updated PDFBOX-1239:
--------------------------------

    Description: 
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to PDFBOX-1095

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



  was:
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to https://issues.apache.org/jira/browse/PDFBOX-1095 (PDFBOX-1095)

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



    
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: Image, PDFBox, PDFToImage, Rendering
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to PDFBOX-1095
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Valerio Santinelli (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Valerio Santinelli updated PDFBOX-1239:
---------------------------------------

    Attachment: iTabloPDFPlusUserGuide.pdf

Page 2 of this document doesn't render the text correctly
                
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: Image, PDFBox, PDFToImage, Rendering
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png, iTabloPDFPlusUserGuide.pdf
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to PDFBOX-1095
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (PDFBOX-1239) Using PDFToImage creates blank images on pages.

Posted by "Nghi Nguyen (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nghi Nguyen updated PDFBOX-1239:
--------------------------------

    Description: 
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to https://issues.apache.org/jira/browse/PDFBOX-1095 (PDFBOX-1095)

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



  was:
I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output.

java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf

Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: BDC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: EMC
Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
INFO: unsupported/disabled operation: i
Writing: afdd2_1_11.png



    
> Using PDFToImage creates blank images on pages.
> -----------------------------------------------
>
>                 Key: PDFBOX-1239
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1239
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDModel
>    Affects Versions: 1.6.0
>         Environment: Windows XP and java version 1.6.0_29
>            Reporter: Nghi Nguyen
>              Labels: Image, PDFBox, PDFToImage, Rendering
>             Fix For: 1.7.0
>
>         Attachments: afdd2_1_1.pdf, afdd2_1_11.png, afdd2_1_110.png
>
>
> I have been using PDFToImage to create images from the PDF documents and everything has been working great until I ran into this file.
> It will create the images of each page but not display the image correctly. Either it will show the image as white or black. I attached the pdf and some of the images it created. Below I have used this command and some of the output. Any ideas? Is it because it is an embedded image relating to https://issues.apache.org/jira/browse/PDFBOX-1095 (PDFBOX-1095)
> java -jar pdfbox-app-1.6.0.jar PDFToImage -imageType png -startPage 1 -endPage 10 afdd2_1_1.pdf
> Feb 28, 2012 9:14:57 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: BDC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EMC
> Feb 28, 2012 9:14:58 PM org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: i
> Writing: afdd2_1_11.png

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira