You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Uwe Schindler (JIRA)" <ji...@apache.org> on 2016/04/22 11:49:13 UTC

[jira] [Comment Edited] (SOLR-8716) Upgrade to Apache Tika 1.12

    [ https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15253638#comment-15253638 ] 

Uwe Schindler edited comment on SOLR-8716 at 4/22/16 9:48 AM:
--------------------------------------------------------------

Hi,

I quickly reviewed the new dependencies: Some are fine (Access Databases), but some may not be relevant for Apache Solr, e.g. the Geo-Stuff. We also exclude purely scientific formats like Netcdf, not useable to "normal" endusers. If you really need those libraries, users can add the JARs on their own. We don't want to bloat the binary release with stuff useless to 99.9% of all users.

In general we are mostly interested in libraries that extract text from "documents", not stuff that just extracts a bit of metadata or other non-document stuff. So I have the feeling Apache SIS is not relevant for the extraction module. Users that want to index geospatial stuff have to use other features of Solr.

For the other new dependencies, we have to add the NOTICE.txt entries (inside Solr).


was (Author: thetaphi):
Hi,

I quickly reviewed the new dependencies: Some are fine (Access Databases), but some may not be relevant for Apache Solr, e.g. the Geo-Stuff. We also excluded Netcdf in the past.

In general we are mostly interested in libraries that extract text from "documents", not stuff that just extracts a bit of metadata or other non-document stuff. So I have the feeling Apache SIS is not relevant for the extraction module. Users that want to index geospatial stuff have to use other features of Solr.

For the other new dependencies, we have to add the NOTICE.txt entries (inside Solr).

> Upgrade to Apache Tika 1.12
> ---------------------------
>
>                 Key: SOLR-8716
>                 URL: https://issues.apache.org/jira/browse/SOLR-8716
>             Project: Solr
>          Issue Type: Improvement
>          Components: contrib - Solr Cell (Tika extraction)
>            Reporter: Lewis John McGibbney
>            Assignee: Uwe Schindler
>             Fix For: master
>
>         Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org