You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Pablo Junge (Jira)" <ji...@apache.org> on 2019/12/05 22:33:00 UTC

[jira] [Updated] (HIVE-22587) hive.stats.ndv.error parameter documentation issue in HiveConf.java

     [ https://issues.apache.org/jira/browse/HIVE-22587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pablo Junge updated HIVE-22587:
-------------------------------
    Attachment: HIVE-22587.patch
        Status: Patch Available  (was: Open)

Changed definition of HIVE_STATS_NDV_ERROR 

 

HIVE_STATS_NDV_ERROR("hive.stats.ndv.error", (float)20.0,
 "The standard error allowed for NDV estimates, expressed in percentage. This provides a tradeoff \n" + 
 "between accuracy and compute cost. A lower value for the error indicates higher accuracy and a \n" +
 "higher compute cost. (NDV means the number of distinct values.). It only affects the FM-Sketch \n" +
 "(not the HLL algorithm which is the default), where it computes the number of necessary\n" +
 " bitvectors to achieve the accuracy.")

> hive.stats.ndv.error parameter documentation issue in HiveConf.java
> -------------------------------------------------------------------
>
>                 Key: HIVE-22587
>                 URL: https://issues.apache.org/jira/browse/HIVE-22587
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>            Reporter: Pablo Junge
>            Assignee: Pablo Junge
>            Priority: Minor
>         Attachments: HIVE-22587.patch
>
>
> hive.stats.ndv.error parameter documentation should specify that it only affects the FM-Sketch algorithm



--
This message was sent by Atlassian Jira
(v8.3.4#803005)