You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/07/24 16:24:00 UTC
[jira] [Commented] (FLINK-8478) Add support for time-bounded joins
in DataStream API
[ https://issues.apache.org/jira/browse/FLINK-8478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16554479#comment-16554479 ]
ASF GitHub Bot commented on FLINK-8478:
---------------------------------------
GitHub user florianschmidt1994 opened a pull request:
https://github.com/apache/flink/pull/6407
[FLINK-8478][docs] Add documentation page for different datastream joins
## What is the purpose of the change
Add a documentation page under Application Development / Streaming / Joining that describes
1. The different types of window joins in the DataStream API
2. The newly introduced interval join in the DataStream API
## Brief change log
- Added a new docs page
- Added images to describe common scenarios
## Verifying this change
Built the documentation with `./build_docs.sh` and it looks like expected
## Does this pull request potentially affect one of the following parts:
- Dependencies (does it add or upgrade a dependency): no
- The public API, i.e., is any changed class annotated with `@Public(Evolving)`: no
- The serializers: no
- The runtime per-record code paths (performance sensitive): no
- Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: no
- The S3 file system connector: no
## Documentation
- Does this pull request introduce a new feature? no
- If yes, how is the feature documented? not applicable
For those that just want to have a "quick look" I attached a screenshot
![](https://i.imgur.com/c78WuD7.jpg)
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/florianschmidt1994/flink flink-8478-add-docs-for-joins
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/6407.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6407
----
commit d2800ff33af179dd32876020d27994b9dc6579aa
Author: Florian Schmidt <fl...@...>
Date: 2018-07-24T16:14:50Z
[FLINK-8478] Add documentation page for different datastream joins
----
> Add support for time-bounded joins in DataStream API
> ----------------------------------------------------
>
> Key: FLINK-8478
> URL: https://issues.apache.org/jira/browse/FLINK-8478
> Project: Flink
> Issue Type: New Feature
> Components: DataStream API, Java API, Scala API
> Reporter: Florian Schmidt
> Assignee: Florian Schmidt
> Priority: Major
> Labels: pull-request-available
>
> The goal is to implement a DataStream API to join together two streams based on a key and user defined time boundaries.
> This feature should mirror what is already possible in Flink SQL:
> {code:java}
> SELECT s1.value, s2.value FROM Table s1 JOIN Table s2 ON o.key = s.key AND o.rowtime BETWEEN s.rowtime AND s.rowtime + INTERVAL '1' HOUR;
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)