You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by "Yi Pan (Data Infrastructure) (JIRA)" <ji...@apache.org> on 2015/04/16 01:50:59 UTC

[jira] [Created] (SAMZA-653) Stream-to-stream join operator w/ time-based window

Yi Pan (Data Infrastructure) created SAMZA-653:
--------------------------------------------------

             Summary: Stream-to-stream join operator w/ time-based window
                 Key: SAMZA-653
                 URL: https://issues.apache.org/jira/browse/SAMZA-653
             Project: Samza
          Issue Type: Sub-task
          Components: sql
            Reporter: Yi Pan (Data Infrastructure)


The task is to implement a join operator for the query below:

{code}
SELECT a.time as time, a.id as id, a.a, b.b FROM A as a JOIN B as b ON  a.id = b.id AND a.time > b.time - 3 MIN AND a.time < b.time + 5 MIN
{code}

The join operator needs to implement the following two functions:

# Given a incoming message from window operator from stream A, retrieve all relevant messages for the join from stream B's window operator
# Given two sets of messages from stream A and B, perform the join logic to create the output tuples.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)