You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2011/02/24 00:22:39 UTC

[jira] Created: (CASSANDRA-2232) Clean up and document EstimatedHistogram

Clean up and document EstimatedHistogram
----------------------------------------

                 Key: CASSANDRA-2232
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2232
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Jonathan Ellis
            Assignee: Jonathan Ellis
            Priority: Minor
             Fix For: 0.7.3


EstimatedHistogram treats adding value n as adding a value infinitesimally greater than n.  This barely made sense for the original goal of latency tracking but is clearly broken for inherently integral data like sstables-per-read.

Also, median() is broken, but even a non-broken median() would not be correct to use in mean row size reporting which is its only caller.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (CASSANDRA-2232) Clean up and document EstimatedHistogram

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-2232:
--------------------------------------

    Remaining Estimate: 2h
     Original Estimate: 2h

> Clean up and document EstimatedHistogram
> ----------------------------------------
>
>                 Key: CASSANDRA-2232
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2232
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jonathan Ellis
>            Assignee: Jonathan Ellis
>            Priority: Minor
>             Fix For: 0.7.3
>
>         Attachments: 2232.txt
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> EstimatedHistogram treats adding value n as adding a value infinitesimally greater than n.  This barely made sense for the original goal of latency tracking but is clearly broken for inherently integral data like sstables-per-read.
> Also, median() is broken, but even a non-broken median() would not be correct to use in mean row size reporting which is its only caller.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (CASSANDRA-2232) Clean up and document EstimatedHistogram

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-2232:
--------------------------------------

    Attachment: 2232.txt

fixes problems and adds documentation

> Clean up and document EstimatedHistogram
> ----------------------------------------
>
>                 Key: CASSANDRA-2232
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2232
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jonathan Ellis
>            Assignee: Jonathan Ellis
>            Priority: Minor
>             Fix For: 0.7.3
>
>         Attachments: 2232.txt
>
>
> EstimatedHistogram treats adding value n as adding a value infinitesimally greater than n.  This barely made sense for the original goal of latency tracking but is clearly broken for inherently integral data like sstables-per-read.
> Also, median() is broken, but even a non-broken median() would not be correct to use in mean row size reporting which is its only caller.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Commented: (CASSANDRA-2232) Clean up and document EstimatedHistogram

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-2232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12999400#comment-12999400 ] 

Hudson commented on CASSANDRA-2232:
-----------------------------------

Integrated in Cassandra-0.7 #321 (See [https://hudson.apache.org/hudson/job/Cassandra-0.7/321/])
    

> Clean up and document EstimatedHistogram
> ----------------------------------------
>
>                 Key: CASSANDRA-2232
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2232
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jonathan Ellis
>            Assignee: Jonathan Ellis
>            Priority: Minor
>             Fix For: 0.7.3
>
>         Attachments: 2232.txt
>
>
> EstimatedHistogram treats adding value n as adding a value infinitesimally greater than n.  This barely made sense for the original goal of latency tracking but is clearly broken for inherently integral data like sstables-per-read.
> Also, median() is broken, but even a non-broken median() would not be correct to use in mean row size reporting which is its only caller.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira