You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hong-Thai Nguyen (JIRA)" <ji...@apache.org> on 2014/02/17 13:17:21 UTC

[jira] [Resolved] (TIKA-1223) Extract thumbnail of OOXML Office files

     [ https://issues.apache.org/jira/browse/TIKA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hong-Thai Nguyen resolved TIKA-1223.
------------------------------------

    Resolution: Fixed

r1568954

> Extract thumbnail of OOXML Office files
> ---------------------------------------
>
>                 Key: TIKA-1223
>                 URL: https://issues.apache.org/jira/browse/TIKA-1223
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.4
>            Reporter: Hong-Thai Nguyen
>            Assignee: Hong-Thai Nguyen
>            Priority: Minor
>             Fix For: 1.6
>
>         Attachments: TIKA-1223.patch
>
>
> From Microsoft Office 2007 file formats, thumbnail could be included in package. We can extract this embedded thumbnail for OOXML files.
> As discussed in mailing list, we should extract thumbnail as a attachment, not as metadata (TIKA-90).
> {noformat}
> embeddedRelationId format is thumbnail_{i}.{extension}.
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)