You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (Created) (JIRA)" <ji...@apache.org> on 2011/11/01 16:27:32 UTC

[jira] [Created] (TIKA-766) Trim down the NetCDF dependency

Trim down the NetCDF dependency
-------------------------------

                 Key: TIKA-766
                 URL: https://issues.apache.org/jira/browse/TIKA-766
             Project: Tika
          Issue Type: Improvement
          Components: packaging, parser
            Reporter: Jukka Zitting
            Priority: Minor


As noted in TIKA-763, the NetCDF dependency contains a few LGPL classes that we should get rid of, ideally without the workaround added for TIKA-763.

Additionally, with 4.2MB the NetCDF jar is pretty large and includes lots of stuff that isn't really related to parsing NetCDF and HDF files.

It would be nice if the NetCDF project could produce a separately packaged read-only parser library that only contains the stuff needed by Tika.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-766) Trim down the NetCDF dependency

Posted by "john caron (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13399697#comment-13399697 ] 

john caron commented on TIKA-766:
---------------------------------

Ive attempted to get any LGPL code out of netcdf-java version 4.3.11, which will be released next week. 

Weve also done more work on mavenizing it, hopefully that will work for you.

In terms of size, I can cut it down some, but not much (maybe down to 3 M?). If you only want netcdf-3, that could be very small. But if you want all those other file formats, that gets pretty big. Both GRIB and BUFR are now in optional jars.

John
                
> Trim down the NetCDF dependency
> -------------------------------
>
>                 Key: TIKA-766
>                 URL: https://issues.apache.org/jira/browse/TIKA-766
>             Project: Tika
>          Issue Type: Improvement
>          Components: packaging, parser
>            Reporter: Jukka Zitting
>            Priority: Minor
>
> As noted in TIKA-763, the NetCDF dependency contains a few LGPL classes that we should get rid of, ideally without the workaround added for TIKA-763.
> Additionally, with 4.2MB the NetCDF jar is pretty large and includes lots of stuff that isn't really related to parsing NetCDF and HDF files.
> It would be nice if the NetCDF project could produce a separately packaged read-only parser library that only contains the stuff needed by Tika.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira