You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@calcite.apache.org by "Benchao Li (Jira)" <ji...@apache.org> on 2023/05/20 12:49:00 UTC
[jira] [Commented] (CALCITE-4334) LITERAL_AGG, an aggregate function that returns a constant value
[ https://issues.apache.org/jira/browse/CALCITE-4334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17724529#comment-17724529 ]
Benchao Li commented on CALCITE-4334:
-------------------------------------
I understand the intention and benefit of introducing LITERAL_AGG, what I'm thinking is about the limitations it may introduce, some points are below:
- Does the new LITERAL_AGG affect the predicate deducing and expression simplifying, e.g. constant folding?
- Is the new LITERAL_AGG properly handled in Rel2SqlConverter, since this is an internal aggregate function, we may need to prevent it being translated to other dialects.
> LITERAL_AGG, an aggregate function that returns a constant value
> ----------------------------------------------------------------
>
> Key: CALCITE-4334
> URL: https://issues.apache.org/jira/browse/CALCITE-4334
> Project: Calcite
> Issue Type: Bug
> Reporter: Julian Hyde
> Assignee: Julian Hyde
> Priority: Major
>
> It would be useful to have an aggregate function that returns a constant value, regardless of how many rows are in the group. We propose {{LITERAL_AGG}}, an internal aggregate function. As an aggregate function it has no arguments (meaning that it does not read any columns from the input), but a call will have a RexLiteral (constant expression).
> This aggregate function is internal, so there is no SQL syntax. But if I were to write
> {code}
> SELECT deptno, SUM(sal), true
> FROM Emp
> GROUP BY deptno
> {code}
> I should get the following plan:
> {code}
> Aggregate(group={deptno}, aggCalls=SUM($4), LITERAL_AGG(true))
> Scan(Emp)
> {code}
> Today, the plan would require an extra {{Project}} to add {{RexLiteral(true)}}. Planner rules have to look for a {{Project}} of a literal on top of an {{Aggregate}}; by removing the {{Project}} we simplify planning.
> For example, when rewriting sub-queries (see {{SubQueryRemoveRule}}) we introduce add "true as indicator" to the SELECT clause of sub-queries. It can be used to detect rows generated by an outer join. If it is an aggregate query, we would have to write "min(true) as indicator", which necessitates an extra {{Project}} below the {{Aggregate}} to provide the "true" value. The LITERAL_AGG aggregate function allows us to avoid the extra {{Project}}.
> Another example came up during CALCITE-4317. We would like to make {{RelBuilder.aggregate(groupKey())}} throw when given an empty group key and no aggregate calls. (Because it would create an {{Aggregate}} that has zero fields, and that is problematic elsewhere in Calcite.) But we would also like a pattern where an aggregate with an empty group key becomes a constant single-row relational expression. So, {{RelBulder.aggregate(groupKey(), aggregateCall(LITERAL_AGG(true)))}} should generate {{VALUES TRUE}}.
> {{LITERAL_AGG}} uses {{interface SqlStaticAggFunction}}, an interface that can optionally be implemented (or wrapped, via {{SqlAggFunction.unwrap(Class)}}), to make the planner and code-generator aware of its properties.
> The implementation adds a {{List<RexNode> rexList}} field to {{AggregateCall}}. This field is empty in all conventional aggregate functions, and has one element in {{LITERAL_AGG}}.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)