You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/11/05 19:34:32 UTC

[jira] Created: (NUTCH-763) Separate configuration files from resources to be included in the job file

Separate configuration files from resources to be included in the job file
--------------------------------------------------------------------------

                 Key: NUTCH-763
                 URL: https://issues.apache.org/jira/browse/NUTCH-763
             Project: Nutch
          Issue Type: Wish
            Reporter: Julien Nioche
            Priority: Minor
             Fix For: 1.1


One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time : 
- configuration files for Hadoop / Nutch which are put in the jar files but not used there
- resource files (e.g. filtering rules) which MUST be up to date in the job file

I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-763) Separate configuration files from resources to be included in the job file

Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated NUTCH-763:
------------------------------------

    Fix Version/s:     (was: 1.1)

- pushing this out per http://bit.ly/c7tBv9

> Separate configuration files from resources to be included in the job file
> --------------------------------------------------------------------------
>
>                 Key: NUTCH-763
>                 URL: https://issues.apache.org/jira/browse/NUTCH-763
>             Project: Nutch
>          Issue Type: Wish
>            Reporter: Julien Nioche
>            Priority: Minor
>
> One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time : 
> - configuration files for Hadoop / Nutch which are put in the jar files but not used there
> - resource files (e.g. filtering rules) which MUST be up to date in the job file
> I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.