You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Sergey Beryozkin (JIRA)" <ji...@apache.org> on 2014/06/17 21:47:08 UTC

[jira] [Comment Edited] (TIKA-411) Generate list of supported and detected types automatically

    [ https://issues.apache.org/jira/browse/TIKA-411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034269#comment-14034269 ] 

Sergey Beryozkin edited comment on TIKA-411 at 6/17/14 7:46 PM:
----------------------------------------------------------------

I was thinking that if this information is exposed by the running Tika server used by a remote client then the parser names do not have to be exposed, the remote client wants to get a given file handled by the server. 
The information of parser classes and the types they recognize does seem to 'belong' to Tika site or the current documentation, etc

Thanks, Sergey 


was (Author: sergey_beryozkin):
I was thinking that if this information exposed by the running Tika server used by a remote client then the parser names do not have to be exposed, the remote client wants to get a given file handled by the server. 
The information of parser classes and the types they recognize does seem to 'belong' to Tika site or the current documentation, etc

Thanks, Sergey 

> Generate list of supported and detected types automatically
> -----------------------------------------------------------
>
>                 Key: TIKA-411
>                 URL: https://issues.apache.org/jira/browse/TIKA-411
>             Project: Tika
>          Issue Type: Improvement
>          Components: documentation
>            Reporter: Jukka Zitting
>            Priority: Minor
>         Attachments: TIKA-411.patch, TIKA-411.screenshot.png
>
>
> Currently we edit the list of supported types (http://lucene.apache.org/tika/0.7/formats.html) manually, which is bound to leave the list outdated and incomplete. It would be better if the list was automatically generated from the tika-mimetypes.xml file and the getSupportedTypes() response of the AutoDetectParser class.



--
This message was sent by Atlassian JIRA
(v6.2#6252)