You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Richard Ding (JIRA)" <ji...@apache.org> on 2010/08/06 20:00:20 UTC
[jira] Commented: (PIG-103) Shared Job /tmp location should be
configurable
[ https://issues.apache.org/jira/browse/PIG-103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896102#action_12896102 ]
Richard Ding commented on PIG-103:
----------------------------------
The patch looks good. A couple of comments:
* In FileLocalizer, it's better to call the getProperty
{code}
String tdir= pigContext.getProperties().getProperty("pig.temp.loc", "/tmp");
{code}
from inside of the if-block so it only gets called when needed.
* In the unit test, it world be good to verify the method
{code}
FileLocalizer.getTemporaryPath(PigContext pigContext)
{code}
returns the correct temp directory.
> Shared Job /tmp location should be configurable
> -----------------------------------------------
>
> Key: PIG-103
> URL: https://issues.apache.org/jira/browse/PIG-103
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Environment: Partially shared file:// filesystem (eg NFS)
> Reporter: Craig Macdonald
> Assignee: niraj rai
> Fix For: 0.8.0
>
> Attachments: conf_tmp_dir.patch
>
>
> Hello,
> I'm investigating running pig in an environment where various parts of the file:// filesystem are available on all nodes. I can tell hadoop to use a file:// file system location for it's default, by seting fs.default.name=file://path/to/shared/folder
> However, this creates issues for Pig, as Pig writes it's job information in a folder that it assumes is a shared FS (eg DFS). However, in this scenario /tmp is not shared on each machine.
> So /tmp should either be configurable, or Hadoop should tell you the actual full location set in fs.default.name?
> Straightforward solution is to make "/tmp/" a property in src/org/apache/pig/impl/io/FileLocalizer.java init(PigContext)
> Any suggestions of property names?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.