You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (Jira)" <ji...@apache.org> on 2023/01/07 09:34:00 UTC

[jira] [Commented] (PDFBOX-5178) Parsing differences between 2.0.23 and 2.0.24/3.0

    [ https://issues.apache.org/jira/browse/PDFBOX-5178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17655671#comment-17655671 ] 

Andreas Lehmkühler commented on PDFBOX-5178:
--------------------------------------------

I've added support for the index of the objects within a compressed object stream. But it is limited to those streams where the object numbers aren't unique as in the given file. But this doesn't fix the issue. PDFBox accidentally refers to the correct object. The issue was introduced when I "optimized" the parser to stop reading malformed dictionaries to avoid endless loops. I'm still searching for the correct ticket. BTW, 2.0.24 is affected as well.

> Parsing differences between 2.0.23 and 2.0.24/3.0
> -------------------------------------------------
>
>                 Key: PDFBOX-5178
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5178
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.23, 3.0.0 PDFBox
>            Reporter: Tilman Hausherr
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>         Attachments: poppler-704-0.pdf
>
>
> There are some weird differences in parsing the attached file, 2.0.23 shows "BigTIFF.tif" in the /Contents of the first annotation and a loop at Root/Pages/Kids/[0]/Annots/[0]/FS (always 14 0 R), while 3.0 doesn't have that, but doesn't have "BigTIFF.tif". I'm not sure which one (if any) is wrong.
>  
> UPDATE
> 2.0.24 shows the same behaviour as 3.0



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org