You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/05 19:34:32 UTC
[jira] Created: (NUTCH-763) Separate configuration files from
resources to be included in the job file
Separate configuration files from resources to be included in the job file
--------------------------------------------------------------------------
Key: NUTCH-763
URL: https://issues.apache.org/jira/browse/NUTCH-763
Project: Nutch
Issue Type: Wish
Reporter: Julien Nioche
Priority: Minor
Fix For: 1.1
One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time :
- configuration files for Hadoop / Nutch which are put in the jar files but not used there
- resource files (e.g. filtering rules) which MUST be up to date in the job file
I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (NUTCH-763) Separate configuration files from
resources to be included in the job file
Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/NUTCH-763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann updated NUTCH-763:
------------------------------------
Fix Version/s: (was: 1.1)
- pushing this out per http://bit.ly/c7tBv9
> Separate configuration files from resources to be included in the job file
> --------------------------------------------------------------------------
>
> Key: NUTCH-763
> URL: https://issues.apache.org/jira/browse/NUTCH-763
> Project: Nutch
> Issue Type: Wish
> Reporter: Julien Nioche
> Priority: Minor
>
> One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time :
> - configuration files for Hadoop / Nutch which are put in the jar files but not used there
> - resource files (e.g. filtering rules) which MUST be up to date in the job file
> I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.