You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Fucun Chu (Jira)" <ji...@apache.org> on 2021/05/26 15:53:00 UTC

[jira] [Created] (IMPALA-10718) Implement ds_tuple_arrayofdouble_sketch() and ds_tuple_arrayofdouble_estimate() functions

Fucun Chu created IMPALA-10718:
----------------------------------

             Summary: Implement ds_tuple_arrayofdouble_sketch() and ds_tuple_arrayofdouble_estimate() functions
                 Key: IMPALA-10718
                 URL: https://issues.apache.org/jira/browse/IMPALA-10718
             Project: IMPALA
          Issue Type: New Feature
          Components: Backend, Frontend
            Reporter: Fucun Chu
            Assignee: Fucun Chu


These should be built-in functions that use DataSketches functionality that was integrated by [IMPALA-10717|https://issues.apache.org/jira/browse/IMPALA-10717].

ds_tuple_arrayofdouble_sketch() should receive a primitive expression, parameter and return a sketch.
ds_tuple_arrayofdouble_estimate() should receive a sketch and return a primitive that is the cardinality estimate for that set of data provided to the sketch.

Usage:
{code:java}
select ds_tuple_arrayofdouble_estimate(ds_tuple_arrayofdouble_sketch(col_name, parameter)) from table_name;
{code}
Returns a cardinality estimate for that particular column.

Hive change that introduced the same: https://issues.apache.org/jira/browse/HIVE-22940

https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/exec/DataSketchesFunctions.java




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org