You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pig.apache.org by "Xianda Ke (JIRA)" <ji...@apache.org> on 2016/06/14 02:17:58 UTC

[jira] [Commented] (PIG-4810) Implement Merge join for spark engine

    [ https://issues.apache.org/jira/browse/PIG-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15328798#comment-15328798 ] 

Xianda Ke commented on PIG-4810:
--------------------------------

Hi [~kellyzly], Thanks for your comments. 
1. setReplication() make sense. Thanks.
2. MergeJoin require sorted data as input. MergeJoin optimization will fail UT. That why ORDER query is added.
3. I will fix indent issue.

I will update the patch soon.

> Implement Merge join for spark engine
> -------------------------------------
>
>                 Key: PIG-4810
>                 URL: https://issues.apache.org/jira/browse/PIG-4810
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: liyunzhang_intel
>            Assignee: Xianda Ke
>             Fix For: spark-branch
>
>         Attachments: PIG-4810-2.patch, PIG-4810-3.patch, PIG-4810-4.patch, PIG-4810-5.patch, PIG-4810.patch
>
>
> In current code base(a9151ac), we use regular join to implement merge join in spark mode.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)