You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/03/01 10:49:18 UTC

[jira] [Commented] (TIKA-1882) Updating the tika-mimetypes.xml for new mime magic patterns

    [ https://issues.apache.org/jira/browse/TIKA-1882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15173509#comment-15173509 ] 

ASF GitHub Bot commented on TIKA-1882:
--------------------------------------

GitHub user mkampasi opened a pull request:

    https://github.com/apache/tika/pull/82

    Fix for TIKA-1882

    The following mime magic has been added to tika-mimetypes.xml to better detect the below mime-types:
    
    1. **application/vnd.ms-cab-compressed (.cab files)** - pattern "MCSF" in the first 4 bytes
    2.  **application/vnd.xara (.xar files)** - pattern "xar!" in the first 4 bytes
    3. **application/x-mobipocket-ebook (.mobi files)** - pattern "BOOKMOBI" starting at byte position 60
    4. **video/quicktime (.mov files)** - patterns "free" and "wide" seen starting at byte position 4

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mkampasi/tika master

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tika/pull/82.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #82
    
----
commit f7433daf434a44937ba3ae8b15813a768f95e334
Author: Manisha Kampasi <ma...@gmail.com>
Date:   2016-03-01T07:02:55Z

    Update tika-mimetypes.xml
    
    Updated mime-magic for 4 mime types (tika-mimetypes.xml):
    1. vnd.ms-cab-compressed (.cab files) - pattern "MCSF" in the first 4 bytes
    2. application/vnd.xara (.xar files) - pattern "xar!" in the first 4 bytes
    3. application/x-mobipocket-ebook (.mobi files) - pattern "BOOKMOBI" starting at byte position 60
    4. video/quicktime (.mov files) - patterns "free" and "wide" seen starting at byte position 4

----


> Updating the tika-mimetypes.xml for new mime magic patterns
> -----------------------------------------------------------
>
>                 Key: TIKA-1882
>                 URL: https://issues.apache.org/jira/browse/TIKA-1882
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.11
>            Reporter: Manisha Kampasi
>            Priority: Minor
>              Labels: patch
>
> The following mime magic can be added to better detect the below mime-types:
> 1. vnd.ms-cab-compressed (.cab files) - pattern "MCSF" in the first 4 bytes
> 2. application/vnd.xara (.xar files) - pattern "xar!" in the first 4 bytes
> 3. application/x-mobipocket-ebook (.mobi files) - pattern "BOOKMOBI" starting at byte position 60
> 4. video/quicktime (.mov files) - patterns "free" and "wide" seen starting at byte position 4
> The changes can be seen here:
> https://github.com/mkampasi/tika/commit/f7433daf434a44937ba3ae8b15813a768f95e334



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)