You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/03/28 21:32:25 UTC

[jira] [Comment Edited] (PDFBOX-3292) Error reading stream, expected='endstream' actual='' in non-truncated files

    [ https://issues.apache.org/jira/browse/PDFBOX-3292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214733#comment-15214733 ] 

Tim Allison edited comment on PDFBOX-3292 at 3/28/16 7:31 PM:
--------------------------------------------------------------

links: [051987|http://162.242.228.174/docs/govdocs1/051/051987.pdf] and [869772.pdf|http://162.242.228.174/docs/govdocs1/869/869772.pdf]


was (Author: tallison@mitre.org):
links: [051987|http://162.242.228.174/docs/govdocs1/051/051987.pdf] and [869772.pdf|http://162.242.228.174/docs/govdocs1/869/869772.pdf.pdf]

> Error reading stream, expected='endstream' actual='' in non-truncated files
> ---------------------------------------------------------------------------
>
>                 Key: PDFBOX-3292
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3292
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.0
>            Reporter: Tim Allison
>            Priority: Minor
>
> When PDF files are truncated, one of the most common exceptions in PDFBox 2.0.0 is:
> {noformat}
> java.io.IOException: Error reading stream, expected='endstream' actual='' at offset 165888
> 	at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:999)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseXrefObjStream(COSParser.java:326)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:287)
> 	at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:192)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:249)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:847)
> {noformat}
> There two files in govdocs1 that are NOT truncated and trigger this exception in 2.0.0, but were parsed by PDFBox 1.8.11 with the classic parser.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org