You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@metron.apache.org by simonellistonball <gi...@git.apache.org> on 2017/12/13 21:18:08 UTC

[GitHub] metron issue #867: METRON-1350: Add reservoir sampling functions to Stellar

Github user simonellistonball commented on the issue:

    https://github.com/apache/metron/pull/867
  
    Should the size limit on the sample really be a cut off? In a likely usage scenario a users would sample over a window in a profile. Limiting the size is likely to skew to time at the beginning of the window rather than being genuinely uniform. Would a random replacement strategy make more sense when over the limit? This could be a lot heavier in terms of performance, but may be more mathematically sound.


---