You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2011/02/28 19:00:38 UTC

[jira] Created: (PDFBOX-967) Create Type1C font metrics only when necessary

Create Type1C font metrics only when necessary
----------------------------------------------

                 Key: PDFBOX-967
                 URL: https://issues.apache.org/jira/browse/PDFBOX-967
             Project: PDFBox
          Issue Type: Improvement
          Components: Parsing
    Affects Versions: 1.5.0
            Reporter: Andreas Lehmkühler
            Assignee: Andreas Lehmkühler
            Priority: Minor


The font metrics of a Type1C font are always created during initialization, even if they aren't needed e.g. when extracting the text

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Resolved: (PDFBOX-967) Create Type1C font metrics only when necessary

Posted by "Andreas Lehmkühler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler resolved PDFBOX-967.
---------------------------------------

       Resolution: Fixed
    Fix Version/s: 1.6.0

I added a patch in revision 1075455. The idea goes back to the suggestion of Lars Torunski (see PDFBOX-959)

> Create Type1C font metrics only when necessary
> ----------------------------------------------
>
>                 Key: PDFBOX-967
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-967
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.5.0
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>             Fix For: 1.6.0
>
>         Attachments: prepareFontMetric.jpg
>
>
> The font metrics of a Type1C font are always created during initialization, even if they aren't needed e.g. when extracting the text

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] Updated: (PDFBOX-967) Create Type1C font metrics only when necessary

Posted by "Andreas Lehmkühler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler updated PDFBOX-967:
--------------------------------------

    Attachment: prepareFontMetric.jpg

Moved attachment from PDFBOX-959 to this issue

> Create Type1C font metrics only when necessary
> ----------------------------------------------
>
>                 Key: PDFBOX-967
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-967
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.5.0
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>         Attachments: prepareFontMetric.jpg
>
>
> The font metrics of a Type1C font are always created during initialization, even if they aren't needed e.g. when extracting the text

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Commented] (PDFBOX-967) Create Type1C font metrics only when necessary

Posted by "Lars Torunski (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PDFBOX-967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13072357#comment-13072357 ] 

Lars Torunski commented on PDFBOX-967:
--------------------------------------

Nice work in 1.6.0! Test with two big documents.

The lucene document creation with pdfbox-1.5.0 took 37354s and with pdfbox-1.6.0 took 8890s --> 4 times faster for me.

> Create Type1C font metrics only when necessary
> ----------------------------------------------
>
>                 Key: PDFBOX-967
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-967
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.5.0
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>             Fix For: 1.6.0
>
>         Attachments: PDFType1Font.jpg, prepareFontMetric.jpg
>
>
> The font metrics of a Type1C font are always created during initialization, even if they aren't needed e.g. when extracting the text

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Updated] (PDFBOX-967) Create Type1C font metrics only when necessary

Posted by "Lars Torunski (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PDFBOX-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lars Torunski updated PDFBOX-967:
---------------------------------

    Attachment: PDFType1Font.jpg

PDFType1Font.jpg: 65% of the pdf indexing is spent on PDFType1Font object creations in 1.5.0. Hopefully 1.6.0 is coming soon.

> Create Type1C font metrics only when necessary
> ----------------------------------------------
>
>                 Key: PDFBOX-967
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-967
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 1.5.0
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Minor
>             Fix For: 1.6.0
>
>         Attachments: PDFType1Font.jpg, prepareFontMetric.jpg
>
>
> The font metrics of a Type1C font are always created during initialization, even if they aren't needed e.g. when extracting the text

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira