You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Joydeep Sen Sarma (JIRA)" <ji...@apache.org> on 2009/02/09 05:44:59 UTC
[jira] Updated: (HIVE-131) insert overwrite directory leaves behind
uncommitted/tmp files from failed tasks
[ https://issues.apache.org/jira/browse/HIVE-131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joydeep Sen Sarma updated HIVE-131:
-----------------------------------
Attachment: HIVE-131.patch.1
> insert overwrite directory leaves behind uncommitted/tmp files from failed tasks
> --------------------------------------------------------------------------------
>
> Key: HIVE-131
> URL: https://issues.apache.org/jira/browse/HIVE-131
> Project: Hadoop Hive
> Issue Type: Bug
> Components: Query Processor
> Reporter: Joydeep Sen Sarma
> Assignee: Joydeep Sen Sarma
> Priority: Critical
> Attachments: HIVE-131.patch.1
>
>
> _tmp files are getting left behind on insert overwrite directory:
> /user/jssarma/ctst1/40422_m_000195_0.deflate <r 3> 13285 2008-12-07 01:47 rw-r--r-- jssarma supergroup
> /user/jssarma/ctst1/40422_m_000196_0.deflate <r 3> 3055 2008-12-07 01:46 rw-r--r-- jssarma supergroup
> /user/jssarma/ctst1/_tmp.40422_m_000033_0 <r 3> 0 2008-12-07 01:53 rw-r--r-- jssarma supergroup
> /user/jssarma/ctst1/_tmp.40422_m_000037_1 <r 3> 0 2008-12-07 01:53 rw-r--r-- jssarma supergroup
> this happened with speculative execution. the code looks good (in fact in this case many speculative tasks were launched - and only a couple caused problems). Almost seems like these files did not appear in the namespace until after the map-reduce job finished and the movetask did a listing of the output dir ..
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.