You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Xuefu Zhang (JIRA)" <ji...@apache.org> on 2014/04/15 18:22:20 UTC

[jira] [Commented] (HIVE-6913) Hive unable to find the hashtable file during complex multi-staged map join

    [ https://issues.apache.org/jira/browse/HIVE-6913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969685#comment-13969685 ] 

Xuefu Zhang commented on HIVE-6913:
-----------------------------------

Nice catch, Brock!

The code line you referred to seems off the description that you gave. In addition, s it hard to construct a test case?

> Hive unable to find the hashtable file during complex multi-staged map join
> ---------------------------------------------------------------------------
>
>                 Key: HIVE-6913
>                 URL: https://issues.apache.org/jira/browse/HIVE-6913
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Brock Noland
>            Assignee: Brock Noland
>         Attachments: HIVE-6913.patch
>
>
> If a query has multiple mapjoins and one of the tables to be mapjoined is empty, the query can result in a "no such file or directory" when looking for the hashtable.
> This is because when we generate a dummy hash table, we do not close the TableScan (TS) operator for that table. Additionally, HashTableSinkOperator (HTSO) outputs it's hash tables in the closeOp method. However, when close is called on HTSO we check to ensure that all parents are closed: https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/exec/Operator.java#L333
> which is not true on this case, because the TS operator for the empty table was never closed.



--
This message was sent by Atlassian JIRA
(v6.2#6252)