You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Matt Hicks (JIRA)" <ji...@apache.org> on 2009/11/18 16:20:39 UTC

[jira] Created: (PDFBOX-562) Some PDFs cannot be processed yet are valid PDFs

Some PDFs cannot be processed yet are valid PDFs
------------------------------------------------

                 Key: PDFBOX-562
                 URL: https://issues.apache.org/jira/browse/PDFBOX-562
             Project: PDFBox
          Issue Type: Bug
          Components: Parsing
    Affects Versions: 0.8.0-incubator, 0.7.3, 0.7.2, 0.7.1, 0.7.0, 1.0.0
         Environment: Windows 7, JDK 1.6, Eclipse
            Reporter: Matt Hicks
            Priority: Critical


The following exception is thrown when attempting to process certain PDFs.  This occurs in the latest code from SVN and causes PDFBox to be useful in any production processing system since not all PDFs are supported.  I have uploaded an example PDF that reproduces the problem here: http://captiveimagination.com/download/test02.pdf

This occurs upon invocation of PDFDocument.load(...):

Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
	at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary cannot be cast to org.apache.pdfbox.cos.COSStream
	at org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
	... 4 more


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PDFBOX-562) Some PDFs cannot be processed yet are valid PDFs

Posted by "Matt Hicks (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matt Hicks updated PDFBOX-562:
------------------------------

    Description: 
The following exception is thrown when attempting to process certain PDFs.  This occurs in the latest code from SVN and causes PDFBox to be useless in any production processing system since not all PDFs are supported.  I have uploaded an example PDF that reproduces the problem here: http://captiveimagination.com/download/test02.pdf

This occurs upon invocation of PDFDocument.load(...):

Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
	at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary cannot be cast to org.apache.pdfbox.cos.COSStream
	at org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
	... 4 more


  was:
The following exception is thrown when attempting to process certain PDFs.  This occurs in the latest code from SVN and causes PDFBox to be useful in any production processing system since not all PDFs are supported.  I have uploaded an example PDF that reproduces the problem here: http://captiveimagination.com/download/test02.pdf

This occurs upon invocation of PDFDocument.load(...):

Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
	at test.Test.main(Test.java:21)
Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary cannot be cast to org.apache.pdfbox.cos.COSStream
	at org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
	... 4 more



Correction

> Some PDFs cannot be processed yet are valid PDFs
> ------------------------------------------------
>
>                 Key: PDFBOX-562
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-562
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 0.7.0, 0.7.1, 0.7.2, 0.7.3, 0.8.0-incubator, 1.0.0
>         Environment: Windows 7, JDK 1.6, Eclipse
>            Reporter: Matt Hicks
>            Priority: Critical
>
> The following exception is thrown when attempting to process certain PDFs.  This occurs in the latest code from SVN and causes PDFBox to be useless in any production processing system since not all PDFs are supported.  I have uploaded an example PDF that reproduces the problem here: http://captiveimagination.com/download/test02.pdf
> This occurs upon invocation of PDFDocument.load(...):
> Exception in thread "main" org.apache.pdfbox.exceptions.WrappedIOException
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:841)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:808)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:733)
> 	at test.Test.main(Test.java:21)
> Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSDictionary cannot be cast to org.apache.pdfbox.cos.COSStream
> 	at org.apache.pdfbox.cos.COSDocument.dereferenceObjectStreams(COSDocument.java:456)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:207)
> 	... 4 more

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.