You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/01/14 23:59:54 UTC

[jira] Resolved: (PDFBOX-603) PDFBox performance issue: Encoding.java getCharacter() method tweak

     [ https://issues.apache.org/jira/browse/PDFBOX-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved PDFBOX-603.
----------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0.0
         Assignee: Jukka Zitting

Thanks! I committed the changes in revision 899454.

> PDFBox performance issue:  Encoding.java  getCharacter() method tweak
> ---------------------------------------------------------------------
>
>                 Key: PDFBOX-603
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-603
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Text extraction
>    Affects Versions: 0.8.0-incubator
>         Environment: All
>            Reporter: Mel Martinez
>            Assignee: Jukka Zitting
>             Fix For: 1.0.0
>
>         Attachments: Encoding.java
>
>
> During parsing / text extraction the Encoding.getCharacter(COSName) method is invoked repeatedly.
> It includes a string test that is performed up front but should only occur rarely.  The code should be restructured slightly to only perform that test later. I.E. it should succeed fast and fail slow.
> I'll post an attachment that rewrites the method slightly.  The performance gains is fairly significant.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.