You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2014/07/14 23:36:04 UTC
[jira] [Created] (HIVE-7402) add `approx_distinct` & composable nDV
UDAFs
Gopal V created HIVE-7402:
-----------------------------
Summary: add `approx_distinct` & composable nDV UDAFs
Key: HIVE-7402
URL: https://issues.apache.org/jira/browse/HIVE-7402
Project: Hive
Issue Type: New Feature
Reporter: Gopal V
Build composable approximate distinct UDAFs into hive.
This is useful for approximate queries, particularly for collapsing partial nDV values whenever a partition is added.
{code}
hive> select approx_distinct(ss_item_sk), approx_distinct(ss_quantity) from tpcds_orc_10000.store_sales;
OK
403760 100
Time taken: 238.258 seconds, Fetched: 1 row(s)
{code}
Prototype hive UDAF/UDFs at https://github.com/t3rmin4t0r/hive-hll-udf/
Uses [~prasanth_j]'s fast HLL++ impl for the horsepower.
--
This message was sent by Atlassian JIRA
(v6.2#6252)