You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Mel Martinez (JIRA)" <ji...@apache.org> on 2010/01/14 22:37:54 UTC

[jira] Created: (PDFBOX-602) PDFBox performance issue: PDPage tweaks

PDFBox performance issue:  PDPage tweaks
----------------------------------------

                 Key: PDFBOX-602
                 URL: https://issues.apache.org/jira/browse/PDFBOX-602
             Project: PDFBox
          Issue Type: Improvement
          Components: PDModel
    Affects Versions: 0.8.0-incubator
         Environment: All
            Reporter: Mel Martinez


During parsing / text extraction the PDPage class' getMediaBox() and getParent() methods are accessed many times.  They are currently slow because the code recalculates the results each time.

These should instead be persisted in instance fields so that they are only calculated once unless the underlying object state has changed.

I will attach a version of the file that includes the fix.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (PDFBOX-602) PDFBox performance issue: PDPage tweaks

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved PDFBOX-602.
----------------------------------

       Resolution: Fixed
    Fix Version/s: 1.0.0
         Assignee: Jukka Zitting

Good idea! I committed the changes in revision 899459.

> PDFBox performance issue:  PDPage tweaks
> ----------------------------------------
>
>                 Key: PDFBOX-602
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-602
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: PDModel
>    Affects Versions: 0.8.0-incubator
>         Environment: All
>            Reporter: Mel Martinez
>            Assignee: Jukka Zitting
>             Fix For: 1.0.0
>
>         Attachments: PDPage.java
>
>
> During parsing / text extraction the PDPage class' getMediaBox() and getParent() methods are accessed many times.  They are currently slow because the code recalculates the results each time.
> These should instead be persisted in instance fields so that they are only calculated once unless the underlying object state has changed.
> I will attach a version of the file that includes the fix.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PDFBOX-602) PDFBox performance issue: PDPage tweaks

Posted by "Mel Martinez (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mel Martinez updated PDFBOX-602:
--------------------------------

    Attachment: PDPage.java

Tweaks the PDPage.java class in a few spots to improve performance.

> PDFBox performance issue:  PDPage tweaks
> ----------------------------------------
>
>                 Key: PDFBOX-602
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-602
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: PDModel
>    Affects Versions: 0.8.0-incubator
>         Environment: All
>            Reporter: Mel Martinez
>         Attachments: PDPage.java
>
>
> During parsing / text extraction the PDPage class' getMediaBox() and getParent() methods are accessed many times.  They are currently slow because the code recalculates the results each time.
> These should instead be persisted in instance fields so that they are only calculated once unless the underlying object state has changed.
> I will attach a version of the file that includes the fix.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.