You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2017/03/12 12:53:04 UTC

[jira] [Comment Edited] (PDFBOX-3714) PDF with blanks at the beginning can't be parsed

    [ https://issues.apache.org/jira/browse/PDFBOX-3714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15903571#comment-15903571 ] 

Andreas Lehmkühler edited comment on PDFBOX-3714 at 3/12/17 12:53 PM:
----------------------------------------------------------------------

I've narrowed it down to the brute force search for xref-tables/streams. I've to dig deeper for the cause

UPDATE:
For some reasons the object 11 0 exists twice in the pdf and one of them is corrupted and triggers the exception.


was (Author: lehmi):
I've narrowed it down to the brute force search for xref-tables/streams. I've to dig deeper for the cause

> PDF with blanks at the beginning can't be parsed
> ------------------------------------------------
>
>                 Key: PDFBOX-3714
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3714
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.4, 2.1.0
>            Reporter: Tilman Hausherr
>         Attachments: PDFBOX-3714-1-fixed.pdf, PDFBOX-3714-1.pdf, PDFBOX-3714-2-fixed.pdf, PDFBOX-3714-2.pdf
>
>
> The attached files don't parse.  The have some CRs and TABs at the beginning. The files parse properly if the blanks are removed. I thought we were resilient against this type of flaw...



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org