You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (Commented) (JIRA)" <ji...@apache.org> on 2012/01/20 16:59:40 UTC

[jira] [Commented] (TIKA-507) Parser for font files

    [ https://issues.apache.org/jira/browse/TIKA-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13189861#comment-13189861 ] 

Nick Burch commented on TIKA-507:
---------------------------------

Thanks for this patch, sorry it has taken so long to get to!

Looking at the supplied .afm files, it looks like they're copyright. As such, I've tweaked the tests to use the sample .afm we already have, which is a specially generated one for Tika (so no copyright problems)

Parser added (with a few tweaks) in r1233973.

Now I guess the next thing is to get some suitably licensed .pfm/.pfa/.pfb files, then look at a parser for those!
                
> Parser for font files
> ---------------------
>
>                 Key: TIKA-507
>                 URL: https://issues.apache.org/jira/browse/TIKA-507
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>         Attachments: AdobeFontMetricParser.zip, TIKA-507.Arreola.110724.patch.txt
>
>
> The FontBox library used by PDFBox supports various kinds of font information files. These files don't typically contain much useful textual data, but they do have interesting metadata that should be made available also through Tika.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira