You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@calcite.apache.org by "Jin Xing (Jira)" <ji...@apache.org> on 2020/01/30 13:51:00 UTC

[jira] [Created] (CALCITE-3760) Rewriting function without considering determinism can break query semantics

Jin Xing created CALCITE-3760:
---------------------------------

             Summary: Rewriting function without considering determinism can break query semantics
                 Key: CALCITE-3760
                 URL: https://issues.apache.org/jira/browse/CALCITE-3760
             Project: Calcite
          Issue Type: Bug
          Components: core
            Reporter: Jin Xing
            Assignee: Jin Xing


Calcite rewrite some *SqlFunctions* during validation. But whether the function is deterministic is not considered. For a non-deterministic operator, the rewriting can break semantics. Additionally there's no interface for user to specify the determinism for a UDF/UDAF. 

Say I have non-deterministic UDF & UDAF and run sql like below
{code:java}
select coalesce(udf(col0), 100) from foo;
select nullif(udaf(col0), 1024) from foo;{code}
They will be rewritten as
{code:java}
select case when udf(col0) is not null then udf(col0) else 100 end
from foo;

select case when udaf(col0)=1024 then null udaf(col0)
from foo{code}
As we can see that non-deterministic UDF & UDAF are called multiple times after written. Thus the condition in WHEN clause might NOT be held all the time.

We need to provide an interface for user to specify the determinism in UDF/UDAF and consider whether a SqlNode is deterministic when rewriting.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)