You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/09/08 14:52:00 UTC

[jira] [Created] (PDFBOX-5501) Jempbox is slow on xmp with large event histories

Tim Allison created PDFBOX-5501:
-----------------------------------

             Summary: Jempbox is slow on xmp with large event histories
                 Key: PDFBOX-5501
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5501
             Project: PDFBox
          Issue Type: Wish
            Reporter: Tim Allison
         Attachments: big.xmp.gz

In looking at the timeouts in a recent run against 8 million PDFs, I found one file where the processing time was caused by extremely slow parsing of the media management schema.

If I do enough subclassing and put a hard limit inside getEventSequenceList(), the processing time is fairly quick.

I realize that Jempbox is not going to be supported going forward and understand if this is a "do not fix".



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org