You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Hyunsik Choi (JIRA)" <ji...@apache.org> on 2014/08/13 10:45:12 UTC

[jira] [Resolved] (TAJO-972) Broadcast join with left outer join returns duplicated rows.

     [ https://issues.apache.org/jira/browse/TAJO-972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyunsik Choi resolved TAJO-972.
-------------------------------

       Resolution: Fixed
    Fix Version/s: 0.9.0

already committed it to master branch.

> Broadcast join with left outer join returns duplicated rows.
> ------------------------------------------------------------
>
>                 Key: TAJO-972
>                 URL: https://issues.apache.org/jira/browse/TAJO-972
>             Project: Tajo
>          Issue Type: Bug
>            Reporter: Hyoungjun Kim
>            Assignee: Hyoungjun Kim
>            Priority: Minor
>             Fix For: 0.9.0
>
>
> If LEFT OUTER JOIN has broadcast table and broadcast target table is left side, every tasks run join operation with all rows in broadcast table. So some tasks match and other tasks doesn't match. 
> For example:
> {noformat}
> default>select * from small
> id
> -----------------
> 1
> 2
> 3
> default>select * from large
> 1
> 4    <-- Block1 in HDFS
> 5
> ...
> 2    <-- Block2 in HDFS
> 6
> default> select a.id, b.id from small a left outer join large b on a.id = b.id
> a.id    b.id
> ---------------------------
> 1  1
> 2  null
> 3  null
> 1  null
> 2  2
> 3  null
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)