You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ning Zhang (JIRA)" <ji...@apache.org> on 2010/07/13 23:42:49 UTC

[jira] Commented: (HIVE-1403) Reporting progress to JT during closing files in FileSinkOperator

    [ https://issues.apache.org/jira/browse/HIVE-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12888028#action_12888028 ] 

Ning Zhang commented on HIVE-1403:
----------------------------------

Siying, the new patch looks good in general, but can you open a different JIRA and upload your patch to that?

> Reporting progress to JT during closing files in FileSinkOperator
> -----------------------------------------------------------------
>
>                 Key: HIVE-1403
>                 URL: https://issues.apache.org/jira/browse/HIVE-1403
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>             Fix For: 0.6.0
>
>         Attachments: HIVE-1403.1.patch, HIVE-1403.patch
>
>
> If there are too many files need to be closed in FileSinkOperator (e.g., if DynamicPartition/FileSpray is turned on), there could be many files generated by each task and they need to be closed at the FileSinkOperator.closeOp(). If the NN is overloaded each file close could take more than 1 sec. This sometimes make JT think the task is dead since it takes too long to close all the files and without any progress report. We need to report progress after a while during file closing. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.