You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Navis (JIRA)" <ji...@apache.org> on 2013/06/03 02:56:20 UTC

[jira] [Commented] (HIVE-4620) MR temp directory conflicts in case of parallel execution mode

    [ https://issues.apache.org/jira/browse/HIVE-4620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13672744#comment-13672744 ] 

Navis commented on HIVE-4620:
-----------------------------

It would be good to make a phabricator or review-board entry. 

in TaskRunner.run()
{noformat}
taskRunnerID.set(taskCounter.incrementAndGet());
{noformat}
Is it necessary? If it is, is that should be called in runSequential() rather than run()?
                
> MR temp directory conflicts in case of parallel execution mode
> --------------------------------------------------------------
>
>                 Key: HIVE-4620
>                 URL: https://issues.apache.org/jira/browse/HIVE-4620
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.11.0
>            Reporter: Prasad Mujumdar
>            Assignee: Prasad Mujumdar
>             Fix For: 0.12.0
>
>         Attachments: HIVE-4620-1.patch, HIVE-4620-2.patch
>
>
> In parallel query execution mode, all the parallel running task ends up sharing the same temp/scratch directory. This could lead to file conflicts and temp files getting deleted before the job completion.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira