You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Manisha Kampasi (JIRA)" <ji...@apache.org> on 2016/03/01 08:23:18 UTC

[jira] [Created] (TIKA-1882) Updating the tika-mimetypes.xml for new mime magic patterns

Manisha Kampasi created TIKA-1882:
-------------------------------------

             Summary: Updating the tika-mimetypes.xml for new mime magic patterns
                 Key: TIKA-1882
                 URL: https://issues.apache.org/jira/browse/TIKA-1882
             Project: Tika
          Issue Type: Improvement
          Components: mime
    Affects Versions: 1.11
            Reporter: Manisha Kampasi
            Priority: Minor


The following mime magic can be added to better detect the below mime-types:

1. vnd.ms-cab-compressed (.cab files) - pattern "MCSF" in the first 4 bytes
2. application/vnd.xara (.xar files) - pattern "xar!" in the first 4 bytes
3. application/x-mobipocket-ebook (.mobi files) - pattern "BOOKMOBI" starting at byte position 60
4. video/quicktime (.mov files) - patterns "free" and "wide" seen starting at byte position 4

The changes can be seen here:
https://github.com/mkampasi/tika/commit/f7433daf434a44937ba3ae8b15813a768f95e334



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)