You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/12/18 06:14:46 UTC

[jira] [Commented] (FLINK-2971) Add outer joins to the Table API

    [ https://issues.apache.org/jira/browse/FLINK-2971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15063475#comment-15063475 ] 

ASF GitHub Bot commented on FLINK-2971:
---------------------------------------

GitHub user ChengXiangLi opened a pull request:

    https://github.com/apache/flink/pull/1469

    [FLINK-2971] support outer join for hash join on build side.

    1. There are 4 reserved bytes left in bucket header of `MutableHashTable`, as there are only 9 elements in each bucket, This PR could use 2 bytes to build a BitSet which is used to mark whether elements in that bucket has been probed during probe phase. After probe phase, return the elements which has not been probed at the end.
    2. As build side outer join is supported, we could support more flexible strategy for left outer join, right outer join and full outer join, new supported join types includes:
      * left outer join with `REPARTITION_HASH_FIRST`. 
      * right outer join with `REPARTITION_HASH_SECOND`
      * full outer join with `REPARTITION_HASH_FIRST` or `REPARTITION_HASH_SECOND`.
    3. But there is still some limitations about broadcast hash join, the following join types are still not supported for obviously reason:
      * left outer join with `BROADCAST_HASH_FIRST`.
      * right outer join with `BROADCAST_HASH_SECOND`.
      * full outer join with `BROADCAST_HASH_FIRST` and `BROADCAST_HASH_SECOND`.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/ChengXiangLi/flink hashFullOuterJoin

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/1469.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1469
    
----
commit a3445a0666dd9c349bc11fca2e2554d500175280
Author: chengxiang li <ch...@intel.com>
Date:   2015-12-17T03:34:41Z

    [FLINK-2871] support outer join for hash on build side.

commit 92961bcd26e2dafb70006ea673abb07a67b77c9b
Author: chengxiang li <ch...@intel.com>
Date:   2015-12-18T04:52:55Z

    fix format

----


> Add outer joins to the Table API
> --------------------------------
>
>                 Key: FLINK-2971
>                 URL: https://issues.apache.org/jira/browse/FLINK-2971
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table API
>            Reporter: Timo Walther
>
> Since Flink now supports outer joins, the Table API can also support left, right and full outer joins.
> Given that null values are properly supported by RowSerializer etc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)