You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2014/04/30 12:06:15 UTC

[jira] [Commented] (TIKA-1286) Adding MS Visio VSDX to mime-types detection

    [ https://issues.apache.org/jira/browse/TIKA-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13985341#comment-13985341 ] 

Nick Burch commented on TIKA-1286:
----------------------------------

Is it worth having an intermediate type, eg x-tika-visio-ooxml? Or are these visio subtypes too different to make that helpful?

> Adding MS Visio VSDX to mime-types detection
> --------------------------------------------
>
>                 Key: TIKA-1286
>                 URL: https://issues.apache.org/jira/browse/TIKA-1286
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.5
>         Environment: Any
>            Reporter: Pascal Essiembre
>            Priority: Minor
>              Labels: easyfix
>   Original Estimate: 1m
>  Remaining Estimate: 1m
>
> Visio files under the Open Office XML (ooxml) format are not recognized by the mim-type detector and always returns the family mime-type instead: {{application/x-tika-ooxml}}.
> It turns out most Microsoft OOXML file formats are defined in the tika-mimetypes.xml, but not not Visio.  I have created the list for someone to add:
> {code:xml}
>   <mime-type type="application/vnd.ms-visio.drawing.main+xml">
>     <_comment>Office Open XML Visio Drawing (macro-free)</_comment>
>     <glob pattern="*.vsdx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.template.main+xml">
>     <_comment>Office Open XML Visio Template (macro-free)</_comment>
>     <glob pattern="*.vstx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.stencil.main+xml">
>     <_comment>Office Open XML Visio Stencil (macro-free)</_comment>
>     <glob pattern="*.vssx"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.drawing.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Drawing (macro-enabled)</_comment>
>     <glob pattern="*.vsdm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.template.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Template (macro-enabled)</_comment>
>     <glob pattern="*.vstm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
>   <mime-type type="application/vnd.ms-visio.stencil.macroEnabled.main+xml">
>     <_comment>Office Open XML Visio Stencil (macro-enabled)</_comment>
>     <glob pattern="*.vssm"/>
>     <sub-class-of type="application/x-tika-ooxml"/>
>   </mime-type>
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)