You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@lucene.apache.org by jb...@apache.org on 2019/10/08 13:07:42 UTC

[lucene-solr] branch SOLR-13105-visual updated: SOLR-13105: Inprove correlation docs 5

This is an automated email from the ASF dual-hosted git repository.

jbernste pushed a commit to branch SOLR-13105-visual
in repository https://gitbox.apache.org/repos/asf/lucene-solr.git


The following commit(s) were added to refs/heads/SOLR-13105-visual by this push:
     new 3ab7104  SOLR-13105: Inprove correlation docs 5
3ab7104 is described below

commit 3ab7104fd76929094fa38308a0c4a2187b8353b4
Author: Joel Bernstein <jb...@apache.org>
AuthorDate: Tue Oct 8 09:07:35 2019 -0400

    SOLR-13105: Inprove correlation docs 5
---
 solr/solr-ref-guide/src/statistics.adoc | 14 +++++++-------
 1 file changed, 7 insertions(+), 7 deletions(-)

diff --git a/solr/solr-ref-guide/src/statistics.adoc b/solr/solr-ref-guide/src/statistics.adoc
index e1fc3b0..2df24f8 100644
--- a/solr/solr-ref-guide/src/statistics.adoc
+++ b/solr/solr-ref-guide/src/statistics.adoc
@@ -388,17 +388,17 @@ The `corr` function builds a correlation matrix
 if a matrix is passed as a parameter. The correlation matrix is computed by correlating the *columns*
 of the matrix.
 
-The example below demonstrates power of correlation matrices.
+The example below demonstrates the power of correlation matrices combined with 2 dimensional faceting.
 
 In this example the `facet2D` function is used to generate a two dimensional facet aggregation
-over the *complaint_type_s* field and the *zip_s* field in the *nyc311* complaints collection.
-In this example the top 20 complaint types and the top 20 zip codes for each complaint type is
-calculated. This returns a stream of tuples each containing a *complaint_type_s*, *zip_s* and
-along with the count for the pair.
+over the fields *complaint_type_s* field and *zip_s* from the *nyc311* complaints database.
+The *top 20* complaint types and the *top 25* zip codes for each complaint type are aggregated.
+The result is a stream of tuples each containing fields *complaint_type_s*, *zip_s* and
+the count for the pair.
 
 The `pivot` function is then used to pivot the fields into a *matrix* with the *zip_s*
-field as the *rows* and the *complaint_type* fields as the *columns*. The `count(*)` field populates
-the values inside the cells of the matrix.
+field as the *rows* and the *complaint_type_s* field as the *columns*. The `count(*)` field populates
+the values in the cells of the matrix.
 
 The `corr` function is then used correlate the *rows* of the matrix. This produces a correlation matrix
 shows how complaint types are correlated based on the zip codes they appear in. Another way to look at this