You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@druid.apache.org by GitBox <gi...@apache.org> on 2019/01/18 20:08:19 UTC
[GitHub] leerho commented on issue #6853: horrible histogram
post-aggregation
leerho commented on issue #6853: horrible histogram post-aggregation
URL: https://github.com/apache/incubator-druid/issues/6853#issuecomment-455671914
@quenlang
There are very good reasons why it is not a good idea for a quantiles (or histogram) sketch to "interpolate" between actual values in the data. Fundamentally, in order for the sketch to be independent of the input data distribution, it cannot make any assumptions about what that distribution might be. And, in order to "interpolate", one must make assumptions about the input data distribution. For example, interpolating using linear, power-law, Gaussian, or discontinuous step distributions would produce wildly different results. Therefore it is best to not to make any assumptions about the input data distribution and only return values actually sampled from the real data.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
With regards,
Apache Git Services
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@druid.apache.org
For additional commands, e-mail: commits-help@druid.apache.org