You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@calcite.apache.org by "Jin Xing (Jira)" <ji...@apache.org> on 2020/01/30 13:51:00 UTC
[jira] [Created] (CALCITE-3760) Rewriting function without
considering determinism can break query semantics
Jin Xing created CALCITE-3760:
---------------------------------
Summary: Rewriting function without considering determinism can break query semantics
Key: CALCITE-3760
URL: https://issues.apache.org/jira/browse/CALCITE-3760
Project: Calcite
Issue Type: Bug
Components: core
Reporter: Jin Xing
Assignee: Jin Xing
Calcite rewrite some *SqlFunctions* during validation. But whether the function is deterministic is not considered. For a non-deterministic operator, the rewriting can break semantics. Additionally there's no interface for user to specify the determinism for a UDF/UDAF.
Say I have non-deterministic UDF & UDAF and run sql like below
{code:java}
select coalesce(udf(col0), 100) from foo;
select nullif(udaf(col0), 1024) from foo;{code}
They will be rewritten as
{code:java}
select case when udf(col0) is not null then udf(col0) else 100 end
from foo;
select case when udaf(col0)=1024 then null udaf(col0)
from foo{code}
As we can see that non-deterministic UDF & UDAF are called multiple times after written. Thus the condition in WHEN clause might NOT be held all the time.
We need to provide an interface for user to specify the determinism in UDF/UDAF and consider whether a SqlNode is deterministic when rewriting.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)