You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Michel Tourn (JIRA)" <ji...@apache.org> on 2006/04/26 19:17:03 UTC
[jira] Commented: (HADOOP-167) reducing the number of Configuration
& JobConf objects created
[ http://issues.apache.org/jira/browse/HADOOP-167?page=comments#action_12376516 ]
Michel Tourn commented on HADOOP-167:
-------------------------------------
Avoiding the multiple config-loading messages is a good thing.
This could also be controlled with a verbosity / logging level setting.
Please don't remove the JobConf(Configuration) constructor.
This is the only mechanism available to programatically change your Configuration.
We rely on this in two ways:
1. to select from multiple XML config files that correspond to multiple Hadoop systems.
2. to make some properties (paths) user-dependant.
Ex for 1.
config_ = new Configuration();
config_.addFinalResource(getHadoopAliasConfFile());
jobConf_ = new JobConf(config_);
In fact, it is important for Hadoop to maintain this property:
ALL uses of a JobConf must be configurable at the outset by the caller by passing in a Configuration object.
Common examples of such top-level Hadoop entry points:
Job submission, MapRed in local mode, DFS client calls.
In general we should make sure that we don't FORCE
a long lifetime for a 'cached' JobConf object:
There are applications that need to use new JobConf-s along the way:
1. bec. they must first discover properties of the Hadoop cluster (list files, then submit job)
2. bec. they talk to multiple Hadoop systems (import / export files)
> reducing the number of Configuration & JobConf objects created
> --------------------------------------------------------------
>
> Key: HADOOP-167
> URL: http://issues.apache.org/jira/browse/HADOOP-167
> Project: Hadoop
> Type: Improvement
> Components: conf
> Versions: 0.1.1
> Reporter: Owen O'Malley
> Assignee: Owen O'Malley
> Fix For: 0.2
> Attachments: remove-confs.patch
>
> Currently, Configuration and JobConf objects are created many times during executing a job. In particular, the Task Tracker creates a lot of them. They both clutter up the logs and parse the xml config files over and over again.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira