You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Ilija Pavlic (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/01/05 12:25:39 UTC

[jira] [Issue Comment Edited] (PDFBOX-1202) org.apache.pdfbox.filter.FlateFilter decode SEVERE: Stop reading corrupt stream

    [ https://issues.apache.org/jira/browse/PDFBOX-1202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180286#comment-13180286 ] 

Ilija Pavlic edited comment on PDFBOX-1202 at 1/5/12 11:24 AM:
---------------------------------------------------------------

I have edited the issue to include information about the encryption. I have used the inserted snippet to decrypt the document. Either way, I was able to extract text from pages 1-396 without problems. Therefore I do not think the issue should be closed. Can you please clarify the difference between reading and extracting? Thank you!
                
      was (Author: ipavlic):
    I have edited the issue to include information about the encryption. I have used the inserted snippet to decrypt the document. Either way, I was able to extract text from pages 1-396 without problems. Therefore I do not think the issue should be closed.
                  
> org.apache.pdfbox.filter.FlateFilter decode SEVERE: Stop reading corrupt stream
> -------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1202
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1202
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.6.0
>            Reporter: Ilija Pavlic
>            Priority: Critical
>         Attachments: IATAUnitedStates.pdf
>
>
> Error "org.apache.pdfbox.filter.FlateFilter decode SEVERE: Stop reading corrupt stream" thrown when extracting text.
> The document was loaded with the following snippet:
> document = PDDocument.load("C:/Users/ilija.pavlic/Downloads/TestInput.pdf");
>     if (document.isEncrypted()) {
>         try {
> 	    document.decrypt("");
> 	} catch (InvalidPasswordException e) {
> 	    System.err.println("Error: Document is encrypted with a password.");
> 	    System.exit(1);
> 	}
> }

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira