You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2014/05/12 02:37:14 UTC
[jira] [Commented] (HBASE-1811) Snapshot HFile and region
statistics at compaction time and make info available to clients
[ https://issues.apache.org/jira/browse/HBASE-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994721#comment-13994721 ]
Andrew Purtell commented on HBASE-1811:
---------------------------------------
Is this a dup of HBASE-7958 (well, maybe I had the idea first, but it is execution that counts) or could it be a subtask?
Ping [~jesse_yates] [~lhofhansl] [~stack]
> Snapshot HFile and region statistics at compaction time and make info available to clients
> ------------------------------------------------------------------------------------------
>
> Key: HBASE-1811
> URL: https://issues.apache.org/jira/browse/HBASE-1811
> Project: HBase
> Issue Type: Improvement
> Reporter: Andrew Purtell
> Priority: Minor
>
> Consider snapshotting HFile and region statistics at major and minor compaction time and making the info available to clients:
> * Key statistics
> ** cardinality
> ** length avg/min/max/stdev
> ** information content measure (entropy, etc.)
> ** histogram
> etc.
> * Value statistics
> ** length avg/min/max/stdev
> ** information content measure (entropy, etc.)
> ** histogram
> etc.
> * Region statistics
> ** density estimation
> ** KV count
> ** total storage size (on disk)
> ** total storage size (uncompressed)
> etc.
--
This message was sent by Atlassian JIRA
(v6.2#6252)