You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/06/04 16:29:46 UTC

[GitHub] [beam] damccorm opened a new issue, #20299: BeamSQL Pattern Recognization Functionality

damccorm opened a new issue, #20299:
URL: https://github.com/apache/beam/issues/20299

   The goal of this Jira is to support the following syntax in BeamSQL:
   
   ```
   
   SELECT T.aid, T.bid, T.cid
   FROM MyTable
       MATCH_RECOGNIZE (
         PARTITION BY userid
         ORDER
   BY proctime
         MEASURES
           A.id AS aid,
           B.id AS bid,
           C.id AS cid
        
   PATTERN (A B C)
         DEFINE
           A AS name = 'a',
           B AS name = 'b',
           C AS name
   = 'c'
       ) AS T
   
   ```
   
   
   match_recognize is in SQL standard 2016. Currently Calcite also supports it. A good reference to match_recognize is [1]
   
   This will requires touch core components of BeamSQL:
   1. SQL parser to support the syntax above.
   2. SQL core to implement physical relational operator.
   3. Distributed algorithms to implement a list of functions in a distributed manner.
   
   other references:
   Calcite match_recognize syntax [2]
   
   [1]: https://ci.apache.org/projects/flink/flink-docs-stable/dev/table/streaming/match_recognize.html
   [2]: https://calcite.apache.org/docs/reference.html#syntax-1
   
   Imported from Jira [BEAM-9543](https://issues.apache.org/jira/browse/BEAM-9543). Original Jira may contain additional context.
   Reported by: amaliujia.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org