You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/10/01 12:51:00 UTC

[jira] [Resolved] (NUTCH-1805) Remove unnecessary transitive dependencies from Hadoop core

     [ https://issues.apache.org/jira/browse/NUTCH-1805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel resolved NUTCH-1805.
------------------------------------
    Resolution: Resolved

We rely now only on a fixed set of Hadoop sub-dependencies ("hadoop-common", "hadoop-hdfs", "hadoop-mapreduce-*"). Explicit exclusions of transitive dependencies are configured.

> Remove unnecessary transitive dependencies from Hadoop core 
> ------------------------------------------------------------
>
>                 Key: NUTCH-1805
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1805
>             Project: Nutch
>          Issue Type: Improvement
>          Components: build
>            Reporter: Julien Nioche
>            Priority: Minor
>
> The Hadoop libs are not included in the job file as a Hadoop cluster must already be available in order to use it, however some of its transitive dependencies make it to the job file. We already prevent some but could extend that to :
> 			<exclude org="org.mortbay.jetty"/>
> 			<exclude org="com.sun.jersey"/>
> 			<exclude org="tomcat"/>
> Note that we need some of the Hadoop classes and dependencies in order to run Nutch in local mode.
> Alternatively we could have a separate Ivy profile only for Hadoop and store the dependencies in a separate location so that they do not get copied to the job jar, however this is probably an overkill if the dependencies above are not needed when running in local mode.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)