You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2013/04/22 22:35:15 UTC

[jira] [Created] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES in ChmParser.

Andrzej Bialecki  created TIKA-1110:
---------------------------------------

             Summary: Incorrectly declared SUPPORTED_TYPES in ChmParser.
                 Key: TIKA-1110
                 URL: https://issues.apache.org/jira/browse/TIKA-1110
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.3, 1.4
            Reporter: Andrzej Bialecki 
             Fix For: 1.4


[This link|http://www.iana.org/assignments/media-types/application/vnd.ms-htmlhelp] assigns the official mime type for these files to "application/vnd.ms-htmlhelp". In the wild there are also two other types used:

* application/chm
* application/x-chm

tika-mimetypes.xml uses the correct official mime type, but ChmParser declares that it supports only "application/chm". For this reason content that uses the official mime type (e.g. coming via Detector or parsed using AutoDetectParser, or simply declared in metadata) fails to parse due to unknown mime type.

The fix seems simple - ChmParser should declare also all of the above types in its SUPPORTED_TYPES.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira