You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by Bill Graham <bi...@gmail.com> on 2012/07/26 01:32:39 UTC

writing info to configs

Hi,

Pig internally generates a lot of runtime info that would be useful to have
output in the job config for later debugging analysis. Should we develop a
standard for params are written to the job conf file, but not accepted as
job input?

For example, these params are accepted as input:


pig.exec.reducers.max
pig.exec.reducers.bytes.per.reducer


But these (not-yet-supported) params are not, they would just be produced:

pig.info.reducers.requested.parallel
pig.info.reducers.estimated.parallel
pig.info.reducers.runtime.parallel


I'm proposing the 'pig.info' prefix for this. We can even cause error
messaging to return if someone tries to set these. See this comment for
more context:

https://issues.apache.org/jira/browse/PIG-2779?focusedCommentId=13422680&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13422680

Thoughts?


thanks,
Bill