You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2019/05/15 19:43:00 UTC

[jira] [Created] (PDFBOX-4550) Poor performance with corrupt ToUnicode stream

Tilman Hausherr created PDFBOX-4550:
---------------------------------------

             Summary: Poor performance with corrupt ToUnicode stream
                 Key: PDFBOX-4550
                 URL: https://issues.apache.org/jira/browse/PDFBOX-4550
             Project: PDFBox
          Issue Type: Bug
          Components: Rendering, Text extraction
    Affects Versions: 2.0.15, 3.0.0 PDFBox
            Reporter: Tilman Hausherr
            Assignee: Tilman Hausherr
             Fix For: 3.0.0 PDFBox


A confidential file with lots of corrupt streams has ToUnicode stream with corrupt contents in the beginbfrange segment where start and end have different lengths. This leads to poor performance. Such entries can be skipped.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org