You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/09/08 15:23:00 UTC

[jira] [Commented] (PDFBOX-5501) Jempbox is slow on xmp with large event histories

    [ https://issues.apache.org/jira/browse/PDFBOX-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17601881#comment-17601881 ] 

Tilman Hausherr commented on PDFBOX-5501:
-----------------------------------------

What happens if you use a 1.8.17 snapshot? It turns out we had an issue about this (PDFBOX-5165) but haven't released it.

> Jempbox is slow on xmp with large event histories
> -------------------------------------------------
>
>                 Key: PDFBOX-5501
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5501
>             Project: PDFBox
>          Issue Type: Wish
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: big.xmp.gz
>
>
> In looking at the timeouts in a recent run against 8 million PDFs, I found one file where the processing time was caused by extremely slow parsing of the media management schema.
> If I do enough subclassing and put a hard limit inside getEventSequenceList(), the processing time is fairly quick.
> I realize that Jempbox is not going to be supported going forward and understand if this is a "do not fix".



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org