You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2014/11/26 08:34:12 UTC
[jira] [Comment Edited] (PDFBOX-2523) IOException: Error: Expected
a long type at offset 1218571, instead got 'xref'
[ https://issues.apache.org/jira/browse/PDFBOX-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14225859#comment-14225859 ]
Tilman Hausherr edited comment on PDFBOX-2523 at 11/26/14 7:33 AM:
-------------------------------------------------------------------
full file list from digitalcorpora:
543/543235.pdf
551/551437.pdf
966/966918.pdf
853/853115.pdf
503/503030.pdf
274/274368.pdf
337/337947.pdf
516/516008.pdf
568/568071.pdf
440/440998.pdf
was (Author: tilman):
full file list from digitalcorpora:
543/543235.pdf
551/551437.pdf
966/966918.pdf
853/853115.pdf
503/503030.pdf
274/274368.pdf
337/337947.pdf
516/516008.pdf
> IOException: Error: Expected a long type at offset 1218571, instead got 'xref'
> ------------------------------------------------------------------------------
>
> Key: PDFBOX-2523
> URL: https://issues.apache.org/jira/browse/PDFBOX-2523
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 1.8.8, 2.0.0
> Reporter: Tilman Hausherr
> Attachments: 853115.pdf
>
>
> I get this with the attached file when using the sequential parser:
> {code}
> Exception in thread "main" java.io.IOException: Error: Expected a long type at offset 1218571, instead got 'xref'
> at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1689)
> at org.apache.pdfbox.pdfparser.BaseParser.readObjectNumber(BaseParser.java:1617)
> at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXrefObjStream(NonSequentialPDFParser.java:746)
> at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXref(NonSequentialPDFParser.java:697)
> at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:480)
> at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:1013)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:951)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)