You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/09/08 15:23:00 UTC
[jira] [Commented] (PDFBOX-5501) Jempbox is slow on xmp with large event histories
[ https://issues.apache.org/jira/browse/PDFBOX-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17601881#comment-17601881 ]
Tilman Hausherr commented on PDFBOX-5501:
-----------------------------------------
What happens if you use a 1.8.17 snapshot? It turns out we had an issue about this (PDFBOX-5165) but haven't released it.
> Jempbox is slow on xmp with large event histories
> -------------------------------------------------
>
> Key: PDFBOX-5501
> URL: https://issues.apache.org/jira/browse/PDFBOX-5501
> Project: PDFBox
> Issue Type: Wish
> Reporter: Tim Allison
> Priority: Minor
> Attachments: big.xmp.gz
>
>
> In looking at the timeouts in a recent run against 8 million PDFs, I found one file where the processing time was caused by extremely slow parsing of the media management schema.
> If I do enough subclassing and put a hard limit inside getEventSequenceList(), the processing time is fairly quick.
> I realize that Jempbox is not going to be supported going forward and understand if this is a "do not fix".
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org