You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@datasketches.apache.org by le...@apache.org on 2020/10/22 17:29:36 UTC

[incubator-datasketches-website] branch master updated: Corrections.

This is an automated email from the ASF dual-hosted git repository.

leerho pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-datasketches-website.git


The following commit(s) were added to refs/heads/master by this push:
     new 7799f52  Corrections.
7799f52 is described below

commit 7799f529b806d52d60903be5738abc650219f66d
Author: Lee Rhodes <le...@users.noreply.github.com>
AuthorDate: Thu Oct 22 10:29:10 2020 -0700

    Corrections.
---
 docs/REQ/ReqAccuracyAdversarial.md | 9 ++-------
 1 file changed, 2 insertions(+), 7 deletions(-)

diff --git a/docs/REQ/ReqAccuracyAdversarial.md b/docs/REQ/ReqAccuracyAdversarial.md
index 2d48158..193fca7 100644
--- a/docs/REQ/ReqAccuracyAdversarial.md
+++ b/docs/REQ/ReqAccuracyAdversarial.md
@@ -21,13 +21,9 @@ layout: doc_page
 -->
 # ReqSketch Accuracy with Adversarial Streams
 
-This set of tests characterize the accuracy (or more precisely the rank error) of the ReqSketch using specifically selected adversarial streams.  The goal of this suite of tests is to understand how the rank error of the sketch behaves across all ranks with these specific stream patterns.  All of these tests are run with the same configuration except for the choice of the adversarial stream pattern.
+This set of tests characterize the accuracy (or more precisely the rank error) of the ReqSketch using specifically selected adversarial stream patterns.  The goal of this suite of tests is to understand how the rank error of the sketch behaves across all ranks with these specific stream patterns.  All of these tests are run with the same sketch configuration except for the choice of the adversarial stream pattern.
 
-The design of these tests is quite different from the tests for the *Random Shuffled Streams*.  Here, each test has one pattern and running multiple trials on the same pattern will not produce a nice distribution of error that we can easily analyze. We would like to capture the ranks where the pattern creates the largest error. These aberrant ranks could occur anywhere in the stream.  Instead of choosing 100 plot points where the error is exclusively measured, we want to measure the erro [...]
-
-In this case we collect the statistics of all the errors in 100 contiguous intervals of the stream. For a stream length of 2^20, each interval consists of about ten thousand values.  The errors from these 10K values are fed into a standard quantile sketch as before, and we extract 3 statical quantile points, -3SD, median and +3SD, and plot those 3 values at each of the 100 plot points. 
-
-As you can see, some of these patterns challenge our current a priori calculation of the error bounds, which means we may need to adjust them somewhat. If we do, these plots will be regenerated. 
+The design of these tests is different from the tests for the *Random Shuffled Streams* in one key aspect, we do not shuffle the input stream for each trial.  Here, each test has one pattern and runs multiple trials with exactly the same pattern. The plots then reveal the results of the random process of the sketch itself.  
 
 For those that are interested in the actual code that run these tests can examine the following links.
  
@@ -62,7 +58,6 @@ For those that are interested in the actual code that run these tests can examin
 
 # Results
 
-
 ## Plot 1 Adversarial Pattern: Sorted
 
 <img class="doc-img-qtr" src="{{site.docs_img_dir}}/req/SortedPattern.png" alt="/req/SortedPattern.png" />


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@datasketches.apache.org
For additional commands, e-mail: commits-help@datasketches.apache.org