You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jackrabbit.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/09/29 14:04:16 UTC

[jira] Created: (JCR-2334) Tika-based type detection in jcr-server

Tika-based type detection in jcr-server
---------------------------------------

                 Key: JCR-2334
                 URL: https://issues.apache.org/jira/browse/JCR-2334
             Project: Jackrabbit Content Repository
          Issue Type: Improvement
          Components: jackrabbit-jcr-server
            Reporter: Jukka Zitting
            Assignee: Jukka Zitting


As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-2334) Tika-based type detection in jcr-server

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated JCR-2334:
-------------------------------

    Attachment: JCR-2334.patch

Proposed patch.

> Tika-based type detection in jcr-server
> ---------------------------------------
>
>                 Key: JCR-2334
>                 URL: https://issues.apache.org/jira/browse/JCR-2334
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: jackrabbit-jcr-server
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>         Attachments: JCR-2334.patch
>
>
> As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (JCR-2334) Tika-based type detection in jcr-server

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved JCR-2334.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 2.0-beta6

Resolving as Fixed based on the changes in revision 830670. 

> Tika-based type detection in jcr-server
> ---------------------------------------
>
>                 Key: JCR-2334
>                 URL: https://issues.apache.org/jira/browse/JCR-2334
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: jackrabbit-jcr-server
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>             Fix For: 2.0-beta6
>
>         Attachments: JCR-2334.patch, JCR-2334.patch
>
>
> As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (JCR-2334) Tika-based type detection in jcr-server

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/JCR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated JCR-2334:
-------------------------------

    Attachment: JCR-2334.patch

Minor update to the patch: Fix the javadocs about the default content node type (nt:unstructured instead of nt:resource). Thanks, Angela!

> Tika-based type detection in jcr-server
> ---------------------------------------
>
>                 Key: JCR-2334
>                 URL: https://issues.apache.org/jira/browse/JCR-2334
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: jackrabbit-jcr-server
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>         Attachments: JCR-2334.patch, JCR-2334.patch
>
>
> As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-2334) Tika-based type detection in jcr-server

Posted by "Felix Meschberger (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761133#action_12761133 ] 

Felix Meschberger commented on JCR-2334:
----------------------------------------

I'm not sure, whether replacing the MimeResolver interface with directly using Tika is a good idea. In fact in the Apache Sling WebDAV bundle, I made use of the ability to inject a custom MimeResolver to be able to use Sling's mime type resolver.

So, I would actually favor a solution which would provide a Tika-based Mime Resolver.

> Tika-based type detection in jcr-server
> ---------------------------------------
>
>                 Key: JCR-2334
>                 URL: https://issues.apache.org/jira/browse/JCR-2334
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: jackrabbit-jcr-server
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>         Attachments: JCR-2334.patch, JCR-2334.patch
>
>
> As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (JCR-2334) Tika-based type detection in jcr-server

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/JCR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12770992#action_12770992 ] 

Jukka Zitting commented on JCR-2334:
------------------------------------

Committed the patch in revision 830670.

Felix:
> in the Apache Sling WebDAV bundle, I made use of the ability to inject a custom MimeResolver to be able to use Sling's mime type resolver. 

You can still provide custom type detection functionality by injecting an implementation of the org.apache.tika.detect.Detector interface.

> I would actually favor a solution which would provide a Tika-based Mime Resolver. 

The Detector interface is more expressive than the MimeResolver class we've used so far, so a Tika-based MimeResolver would not be as powerful a solution. For example the MimeResolver class did not support things like content sniffing or Content-Type hints.

> Tika-based type detection in jcr-server
> ---------------------------------------
>
>                 Key: JCR-2334
>                 URL: https://issues.apache.org/jira/browse/JCR-2334
>             Project: Jackrabbit Content Repository
>          Issue Type: Improvement
>          Components: jackrabbit-jcr-server
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>         Attachments: JCR-2334.patch, JCR-2334.patch
>
>
> As discussed on dev@, I'd like to make the jackrabbit-jcr-server component use Apache Tika for automatic media type detection.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.