You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2013/04/22 22:35:15 UTC
[jira] [Created] (TIKA-1110) Incorrectly declared SUPPORTED_TYPES
in ChmParser.
Andrzej Bialecki created TIKA-1110:
---------------------------------------
Summary: Incorrectly declared SUPPORTED_TYPES in ChmParser.
Key: TIKA-1110
URL: https://issues.apache.org/jira/browse/TIKA-1110
Project: Tika
Issue Type: Bug
Affects Versions: 1.3, 1.4
Reporter: Andrzej Bialecki
Fix For: 1.4
[This link|http://www.iana.org/assignments/media-types/application/vnd.ms-htmlhelp] assigns the official mime type for these files to "application/vnd.ms-htmlhelp". In the wild there are also two other types used:
* application/chm
* application/x-chm
tika-mimetypes.xml uses the correct official mime type, but ChmParser declares that it supports only "application/chm". For this reason content that uses the official mime type (e.g. coming via Detector or parsed using AutoDetectParser, or simply declared in metadata) fails to parse due to unknown mime type.
The fix seems simple - ChmParser should declare also all of the above types in its SUPPORTED_TYPES.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira