You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Chetan Mehrotra (JIRA)" <ji...@apache.org> on 2017/08/28 04:10:00 UTC

[jira] [Comment Edited] (OAK-6587) Provide a way to "force" Tika to treat binaries with a different mime type than the jcr:mimeType property

    [ https://issues.apache.org/jira/browse/OAK-6587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16143345#comment-16143345 ] 

Chetan Mehrotra edited comment on OAK-6587 at 8/28/17 4:09 AM:
---------------------------------------------------------------

bq. Should I submit a separate PR to backport this to the 1.6 branch?

No PR required but need to confirm on oak-dev as mostly backport is done for bug fixes and not for improvements [1]. Do you need it soon or can we do it after some time i.e. I wait mostly for one release on trunk before backporting improvements

[1] https://jackrabbit.apache.org/oak/docs/participating.html


was (Author: chetanm):
bq. Should I submit a separate PR to backport this to the 1.6 branch?

No PR required but need to confirm on oak-dev as mostly backport is done for bug fixes and not for improvements [1]

[1] https://jackrabbit.apache.org/oak/docs/participating.html

> Provide a way to "force" Tika to treat binaries with a different mime type than the jcr:mimeType property
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: OAK-6587
>                 URL: https://issues.apache.org/jira/browse/OAK-6587
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: lucene
>            Reporter: Justin Edelson
>            Assignee: Chetan Mehrotra
>             Fix For: 1.8, 1.7.7
>
>
> There are occasions when an existing Tika parser can be used to parse a binary, but Tika doesn't "think" it can because the mime type isn't in the Parser's list of supported types. There appears to be no way to configure this in Tika. Editing the config.xml file only allows for types which are _theoretically_ parseable to be mapped to different parsers; it doesn't change the set of supported types.
> To deal with this, I'd like to add a new configuration node structure named {{mimeTypes}} under the {{tika}} node of lucene indexes. Using this structure, a mapped type can be defined and this type will be used for interaction with Tika.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)