You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2009/09/03 01:19:32 UTC

[jira] Created: (HBASE-1811) Snapshot HFile and region statistics at compaction time and make info available to clients

Snapshot HFile and region statistics at compaction time and make info available to clients
------------------------------------------------------------------------------------------

                 Key: HBASE-1811
                 URL: https://issues.apache.org/jira/browse/HBASE-1811
             Project: Hadoop HBase
          Issue Type: Improvement
            Reporter: Andrew Purtell
            Priority: Minor


Consider snapshotting HFile and region statistics at major and minor compaction time and making the info available to clients:

* Key statistics
 ** cardinality
 ** length avg/min/max/stdev
 ** information content measure (entropy, etc.)
 ** histogram
etc.

* Value statistics
 ** length avg/min/max/stdev
 ** information content measure (entropy, etc.)
 ** histogram
etc.

* Region statistics
 ** density estimation
 ** KV count
 ** total storage size (on disk)
 ** total storage size (uncompressed)
etc. 


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.