You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "Makoto Yui (JIRA)" <ji...@apache.org> on 2017/11/16 07:22:00 UTC

[jira] [Assigned] (HIVEMALL-18) Support approx_count UDAF using HyperLogLog

     [ https://issues.apache.org/jira/browse/HIVEMALL-18?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Makoto Yui reassigned HIVEMALL-18:
----------------------------------

    Assignee: Makoto Yui

> Support approx_count UDAF using HyperLogLog
> -------------------------------------------
>
>                 Key: HIVEMALL-18
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-18
>             Project: Hivemall
>          Issue Type: Sub-task
>            Reporter: Makoto Yui
>            Assignee: Makoto Yui
>            Priority: Minor
>
> https://github.com/addthis/stream-lib could be used for underlying library.
> http://www.slideshare.net/bzamecnik/hyperloglog-in-hive-how-to-count-sheep-efficiently
> https://databricks.com/blog/2016/05/19/approximate-algorithms-in-apache-spark-hyperloglog-and-quantiles.html
> There exist several HLL implementations as Hive UDAF.
> https://github.com/MLnick/hive-udf/wiki
> https://github.com/klout/brickhouse



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)