You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jackrabbit.apache.org by Jukka Zitting <ju...@gmail.com> on 2009/09/28 11:04:54 UTC

Using Tika to detect MIME types for WebDAV

Hi,

Our WebDAV server currently uses the simple
o.a.j.server.io.MimeResolver class for automatically inferring the
MIME media types of documents. This class can only detect types based
on file extensions and incoming type hints and the default type table
we include is not very extensive.

We could improve this situation by using the Detector [1] interface
and the related default configuration from Apache Tika. This way we
could massively extend the number of supported types and also get
content-based type detection (magic bytes) almost for free.

The Detector interface and all the related classes and configuration
entries are included in the relatively small tika-core component that
has no external dependencies, so the impact on jackrabbit-jcr-server
dependencies would be reasonable.

[1] https://svn.apache.org/repos/asf/lucene/tika/trunk/tika-core/src/main/java/org/apache/tika/detect/Detector.java

BR,

Jukka Zitting

Re: Using Tika to detect MIME types for WebDAV

Posted by Alexander Klimetschek <ak...@day.com>.
On Mon, Sep 28, 2009 at 11:04, Jukka Zitting <ju...@gmail.com> wrote:
> We could improve this situation by using the Detector [1] interface
> and the related default configuration from Apache Tika.

+1

Regards,
Alex

-- 
Alexander Klimetschek
alexander.klimetschek@day.com