You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kirill Kozlov (Jira)" <ji...@apache.org> on 2019/09/19 00:54:00 UTC

[jira] [Commented] (BEAM-7609) SqlTransform#getSchema for "SELECT DISTINCT + JOIN" has invalid field names

    [ https://issues.apache.org/jira/browse/BEAM-7609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16932958#comment-16932958 ] 

Kirill Kozlov commented on BEAM-7609:
-------------------------------------

Disabling rule "AggregateProjectMergeRule" seems to "fix" this problem. Further investigations is required to locate the source of the issue.

> SqlTransform#getSchema for "SELECT DISTINCT + JOIN" has invalid field names
> ---------------------------------------------------------------------------
>
>                 Key: BEAM-7609
>                 URL: https://issues.apache.org/jira/browse/BEAM-7609
>             Project: Beam
>          Issue Type: Bug
>          Components: dsl-sql
>    Affects Versions: 2.13.0
>            Reporter: Gleb Kanterov
>            Assignee: Kirill Kozlov
>            Priority: Major
>
> Works in sqlline shell:
> {code}
> Welcome to Beam SQL 2.14.0-SNAPSHOT (based on sqlline version 1.4.0)
> 0: BeamSQL> CREATE EXTERNAL TABLE s1 (id BIGINT) TYPE 'test';
> No rows affected (0.507 seconds)
> 0: BeamSQL> CREATE EXTERNAL TABLE s2 (id BIGINT) TYPE 'test';
> No rows affected (0.004 seconds)
> 0: BeamSQL> SELECT DISTINCT s1.id as lhs, s2.id as rhs FROM s1 JOIN s2 USING (id);
> +---------------------+---------------------+
> |         lhs         |         rhs         |
> +---------------------+---------------------+
> +---------------------+---------------------+
> No rows selected (2.568 seconds)
> {code}
> But doesn't work in the test:
> {code}
>     Schema inputSchema = Schema.of(
>         Schema.Field.of("id", Schema.FieldType.INT32));
>     PCollection<Row> i1 = p.apply(Create.of(ImmutableList.<Row>of())
>         .withCoder(SchemaCoder.of(inputSchema)));
>     PCollection<Row> i2 = p.apply(Create.of(ImmutableList.<Row>of())
>         .withCoder(SchemaCoder.of(inputSchema)));
>     Schema outputSchema = PCollectionTuple
>         .of("i1", i1)
>         .and("i2", i2)
>         .apply(SqlTransform.query("SELECT DISTINCT s1.id as lhs, s2.id as rhs FROM i1 JOIN i2 USING (id)"))
>         .getSchema();
>     assertEquals(ImmutableList.of("lhs", "rhs"), outputSchema.getFieldNames());
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)