You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2014/11/26 08:34:12 UTC

[jira] [Comment Edited] (PDFBOX-2523) IOException: Error: Expected a long type at offset 1218571, instead got 'xref'

    [ https://issues.apache.org/jira/browse/PDFBOX-2523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14225859#comment-14225859 ] 

Tilman Hausherr edited comment on PDFBOX-2523 at 11/26/14 7:33 AM:
-------------------------------------------------------------------

full file list from digitalcorpora:
543/543235.pdf
551/551437.pdf
966/966918.pdf
853/853115.pdf
503/503030.pdf
274/274368.pdf
337/337947.pdf
516/516008.pdf
568/568071.pdf
440/440998.pdf





was (Author: tilman):
full file list from digitalcorpora:
543/543235.pdf
551/551437.pdf
966/966918.pdf
853/853115.pdf
503/503030.pdf
274/274368.pdf
337/337947.pdf
516/516008.pdf


> IOException: Error: Expected a long type at offset 1218571, instead got 'xref'
> ------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2523
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2523
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.8, 2.0.0
>            Reporter: Tilman Hausherr
>         Attachments: 853115.pdf
>
>
> I get this with the attached file when using the sequential parser:
> {code}
> Exception in thread "main" java.io.IOException: Error: Expected a long type at offset 1218571, instead got 'xref'
>         at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1689)
>         at org.apache.pdfbox.pdfparser.BaseParser.readObjectNumber(BaseParser.java:1617)
>         at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXrefObjStream(NonSequentialPDFParser.java:746)
>         at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXref(NonSequentialPDFParser.java:697)
>         at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:480)
>         at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:1013)
>         at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:951)
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)