You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Fabrizio Fortino (Jira)" <ji...@apache.org> on 2021/06/11 08:06:00 UTC

[jira] [Resolved] (OAK-9456) Exclude all possible PSD mime types in default tika config

     [ https://issues.apache.org/jira/browse/OAK-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Fabrizio Fortino resolved OAK-9456.
-----------------------------------
    Fix Version/s: 1.42.0
       Resolution: Fixed

Fixed with revision 1890688.

> Exclude all possible PSD mime types in default tika config
> ----------------------------------------------------------
>
>                 Key: OAK-9456
>                 URL: https://issues.apache.org/jira/browse/OAK-9456
>             Project: Jackrabbit Oak
>          Issue Type: Bug
>          Components: indexing
>    Affects Versions: 1.40.0
>            Reporter: Fabrizio Fortino
>            Assignee: Fabrizio Fortino
>            Priority: Critical
>             Fix For: 1.42.0
>
>
> The default tika config [0] excludes some mime types that are known to be resource-intensive. One of these mime types is *image/x-psd* which should exclude Adobe PSD file formats. As explained in this forum post [1], this file format can have several mime-type values. We need to include all the variations to avoid indexing slow-downs in these cases.
>  
> [0] [https://github.com/apache/jackrabbit-oak/blob/trunk/oak-lucene/src/main/resources/org/apache/jackrabbit/oak/plugins/index/lucene/tika-config.xml]
> [1] https://www.forumming.com/question/18296/what-is-the-correct-mime-type-for-psd-photoshop



--
This message was sent by Atlassian Jira
(v8.3.4#803005)