You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2020/10/13 11:21:41 UTC

[GitHub] [spark] ulysses-you opened a new pull request #30029: [SPARK-33131][SQL] Fix grouping sets with having clause can not resolve qualified col name

ulysses-you opened a new pull request #30029:
URL: https://github.com/apache/spark/pull/30029

### What changes were proposed in this pull request?

Correct the resolution of having clause.

### Why are the changes needed?

The method `ResolveAggregateFunctions.resolveFilterCondInAggregate` aims to do the two things
1. resolve the expression in having.
2. push the having extra agg expression to `Aggregate`

However we only care about 2. If having clause resolution is successful but not exists extra agg expression, we will ignore the resolution. Here is a example:
```
-- Works resolved by `ResolveReferences`
select c1 from values (1) as t1(c1) group by grouping sets(t1.c1) having c1 = 1

-- Works because of the extra expression c1
select c1 as c2 from values (1) as t1(c1) group by grouping sets(t1.c1) having t1.c1 = 1

-- Failed
select c1 from values (1) as t1(c1) group by grouping sets(t1.c1) having t1.c1 = 1
```

It wroks with `Aggregate` without grouping sets through `ResolveReferences`, but Grouping sets not works since the exprId has been changed.

### Does this PR introduce _any_ user-facing change?

Yes, bug fix.

### How was this patch tested?

add test.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org