You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Pablo Junge (Jira)" <ji...@apache.org> on 2019/12/05 22:33:00 UTC
[jira] [Updated] (HIVE-22587) hive.stats.ndv.error parameter
documentation issue in HiveConf.java
[ https://issues.apache.org/jira/browse/HIVE-22587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pablo Junge updated HIVE-22587:
-------------------------------
Attachment: HIVE-22587.patch
Status: Patch Available (was: Open)
Changed definition of HIVE_STATS_NDV_ERROR
HIVE_STATS_NDV_ERROR("hive.stats.ndv.error", (float)20.0,
"The standard error allowed for NDV estimates, expressed in percentage. This provides a tradeoff \n" +
"between accuracy and compute cost. A lower value for the error indicates higher accuracy and a \n" +
"higher compute cost. (NDV means the number of distinct values.). It only affects the FM-Sketch \n" +
"(not the HLL algorithm which is the default), where it computes the number of necessary\n" +
" bitvectors to achieve the accuracy.")
> hive.stats.ndv.error parameter documentation issue in HiveConf.java
> -------------------------------------------------------------------
>
> Key: HIVE-22587
> URL: https://issues.apache.org/jira/browse/HIVE-22587
> Project: Hive
> Issue Type: Improvement
> Components: Hive
> Reporter: Pablo Junge
> Assignee: Pablo Junge
> Priority: Minor
> Attachments: HIVE-22587.patch
>
>
> hive.stats.ndv.error parameter documentation should specify that it only affects the FM-Sketch algorithm
--
This message was sent by Atlassian Jira
(v8.3.4#803005)