You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "He Yongqiang (JIRA)" <ji...@apache.org> on 2010/01/08 21:19:54 UTC

[jira] Updated: (HIVE-964) handle skewed keys for a join in a separate job

     [ https://issues.apache.org/jira/browse/HIVE-964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

He Yongqiang updated HIVE-964:
------------------------------

    Attachment: hive-964-2010-01-08.patch

CHANGES:
1) let row container use hadoop fileformat
2) let row container directly upload file to dfs
3) added logic to handle reducer failure (we should not see results of a failure reducer) .
4) write one file per skew key and combinefileinputformat.

> handle skewed keys for a join in a separate job
> -----------------------------------------------
>
>                 Key: HIVE-964
>                 URL: https://issues.apache.org/jira/browse/HIVE-964
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: He Yongqiang
>         Attachments: hive-964-2009-12-17.txt, hive-964-2009-12-28-2.patch, hive-964-2009-12-29-4.patch, hive-964-2010-01-08.patch
>
>
> The skewed keys can be written to a temporary table or file, and a followup conditional task can be used to perform the join on those keys.
> As a first step, JDBM can be used for those keys

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.