You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@druid.apache.org by GitBox <gi...@apache.org> on 2019/01/18 20:08:19 UTC

[GitHub] leerho commented on issue #6853: horrible histogram post-aggregation

leerho commented on issue #6853: horrible histogram post-aggregation 
URL: https://github.com/apache/incubator-druid/issues/6853#issuecomment-455671914
 
 
   @quenlang 
   
   There are very good reasons why it is not a good idea for a quantiles (or histogram) sketch to "interpolate" between actual values in the data.  Fundamentally, in order for the sketch to be independent of the input data distribution, it cannot make any assumptions about what that distribution might be.  And, in order to "interpolate", one must make assumptions about the input data distribution.  For example, interpolating using linear, power-law, Gaussian, or discontinuous step distributions would produce wildly different results. Therefore it is best to not to make any assumptions about the input data distribution and only return values actually sampled from the real data. 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@druid.apache.org
For additional commands, e-mail: commits-help@druid.apache.org