You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/10 17:20:00 UTC

[jira] [Work logged] (BEAM-12097) OVER clause converted from ZetaSQL to Logical SQL plan

     [ https://issues.apache.org/jira/browse/BEAM-12097?focusedWorklogId=739624&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-739624 ]

ASF GitHub Bot logged work on BEAM-12097:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Mar/22 17:19
            Start Date: 10/Mar/22 17:19
    Worklog Time Spent: 10m 
      Work Description: roger-mike commented on pull request #16911:
URL: https://github.com/apache/beam/pull/16911#issuecomment-1064306954


   Hi @ibzib @apilloud. I have some questions about the equivalencies between the ZetaSQL AST and CalciteSQL AST.  For this query: 
   ```sql
   select sum(Key) over (PARTITION BY Value), count(Key) over (ORDER BY Key) From KeyValue
   ```
   We get the following AST:
   ![Screen Shot 2022-03-01 at 16 26 43](https://user-images.githubusercontent.com/61989986/157715464-a1c26e24-23ef-459b-a81f-858313deccf5.png)
   
   I'm still trying to work out the conversion of the inner classes needed for ```LogicalWindow```. I think the ```ResolvedAnalyticFunctionGroup``` could be mapped to a ```Window.Group``` but still not sure how to convert ```ResolvedWindowPartitioning```, ```ResolvedWindowOrdering```, ```ResolvedAnalyticFunctionCall```.
   
   How do you find these equivalencies? Could you give me some suggestions on how to convert this AST? Thank you.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 739624)
    Time Spent: 40m  (was: 0.5h)

> OVER clause converted from ZetaSQL to Logical SQL plan
> ------------------------------------------------------
>
>                 Key: BEAM-12097
>                 URL: https://issues.apache.org/jira/browse/BEAM-12097
>             Project: Beam
>          Issue Type: New Feature
>          Components: dsl-sql-zetasql
>            Reporter: Andrew Pilloud
>            Assignee: Mike Hernandez
>            Priority: P3
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> The OVER clause isn't supported by our ZetaSQL to Calcite translator. It can be trivially enabled in the parser with the example below, but there is some work required to convert the parsed ZetaSQL proto to Calcite logical operators (mostly in AggregateScanConverter).
> This is the "over clause" TODO here: https://github.com/apache/beam/blob/243128a8fc52798e1b58b0cf1a271d95ee7aa241/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/translation/AggregateScanConverter.java#L147
> {code:java}
> a/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/SqlAnalyzer.java
> +++ b/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/SqlAnalyzer.java
> @@ -144,6 +144,7 @@ public class SqlAnalyzer {
>          .setEnabledLanguageFeatures(
>              new HashSet<>(
>                  Arrays.asList(
> +                    LanguageFeature.FEATURE_ANALYTIC_FUNCTIONS,
>                      LanguageFeature.FEATURE_NUMERIC_TYPE,
>                      LanguageFeature.FEATURE_DISALLOW_GROUP_BY_FLOAT,
>                      LanguageFeature.FEATURE_V_1_2_CIVIL_TIME,
> diff --git a/sdks/java/extensions/sql/zetasql/src/test/java/org/apache/beam/sdk/extensions/sql/zetasql/ZetaSqlDialectSpecTest.java b/sdks/java/extensions/sql/zetasql/src/test/java/org/apache/beam/sdk/extensions/sql/zetasql/ZetaSqlDialectSpecTest.java
> index 33889f34884..fd107ac5721 100644
> --- a/sdks/java/extensions/sql/zetasql/src/test/java/org/apache/beam/sdk/extensions/sql/zetasql/ZetaSqlDialectSpecTest.java
> +++ b/sdks/java/extensions/sql/zetasql/src/test/java/org/apache/beam/sdk/extensions/sql/zetasql/ZetaSqlDialectSpecTest.java
> @@ -3461,6 +3461,16 @@ public class ZetaSqlDialectSpecTest extends ZetaSqlTestBase {
>      zetaSQLQueryPlanner.convertToBeamRel(sql);
>    }
>  
> +  @Test
> +  public void testAnalyticOver() {
> +    String sql = "select sum(Key) over () From KeyValue";
> +
> +    ZetaSQLQueryPlanner zetaSQLQueryPlanner = new ZetaSQLQueryPlanner(config);
> +    thrown.expect(UnsupportedOperationException.class);
> +    thrown.expectMessage("Does not support sub-queries");
> +    zetaSQLQueryPlanner.convertToBeamRel(sql);
> +  }
> +
>    @Test
>    public void testSubstr() {
>      String sql = "SELECT substr(@p0, @p1, @p2)"; {code}
> Current state the test fails:
> {code}
> java.lang.UnsupportedOperationException: Conversion of RESOLVED_ANALYTIC_SCAN is not supported                             
>         at org.apache.beam.sdk.extensions.sql.zetasql.translation.QueryStatementConverter.getConverterRule(QueryStatementConverter.java:108
> )                                                                                                                                          
>         at org.apache.beam.sdk.extensions.sql.zetasql.translation.QueryStatementConverter.convertNode(QueryStatementConverter.java:99)     
>         at java.base/java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:195)                                             
>         at java.base/java.util.Collections$2.tryAdvance(Collections.java:4756)                                                             
>         at java.base/java.util.Collections$2.forEachRemaining(Collections.java:4764)                                                       
>         at java.base/java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:484)                                                 
>         at java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:474)                                          
>         at java.base/java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:913)                                            
>         at java.base/java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234)                                                 
>         at java.base/java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:578)                                                
>         at org.apache.beam.sdk.extensions.sql.zetasql.translation.QueryStatementConverter.convertNode(QueryStatementConverter.java:101)    
>         at org.apache.beam.sdk.extensions.sql.zetasql.translation.QueryStatementConverter.convert(QueryStatementConverter.java:89)         
>         at org.apache.beam.sdk.extensions.sql.zetasql.translation.QueryStatementConverter.convertRootQuery(QueryStatementConverter.java:55)
>         at org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLPlannerImpl.rel(ZetaSQLPlannerImpl.java:98)                                   
>         at org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.convertToBeamRelInternal(ZetaSQLQueryPlanner.java:313)           
>         at org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.convertToBeamRel(ZetaSQLQueryPlanner.java:301)                   
>         at org.apache.beam.sdk.extensions.sql.zetasql.ZetaSQLQueryPlanner.convertToBeamRel(ZetaSQLQueryPlanner.java:285)                   
>         at org.apache.beam.sdk.extensions.sql.zetasql.ZetaSqlDialectSpecTest.testAnalyticOver(ZetaSqlDialectSpecTest.java:3471)   
> {code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)