You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by sethah <gi...@git.apache.org> on 2018/01/19 18:57:04 UTC

[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass summary example...

GitHub user sethah opened a pull request:

    https://github.com/apache/spark/pull/20332

    [SPARK-23138][ML][DOC] Multiclass summary example and user guide

    ## What changes were proposed in this pull request?
    
    User guide and examples are updated to reflect multiclass logistic regression summary which was added in [SPARK-17139](https://issues.apache.org/jira/browse/SPARK-17139).
    
    I did not make a separate summary example, but added the summary code to the multiclass example that already existed. I don't see the need for a separate example for the summary. 
    
    ## How was this patch tested?
    
    Docs and examples only. Ran all examples locally using spark-submit.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/sethah/spark multiclass_summary_example

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20332.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20332
    
----
commit 9299fc83d2edab956bd13b2e1c985f64dcd2643e
Author: sethah <sh...@...>
Date:   2018-01-19T17:52:10Z

    adding examples for python, scala, and java

commit bf076ed09abb3bb474e0925b3b9c4dbc6e90771a
Author: sethah <sh...@...>
Date:   2018-01-19T18:43:01Z

    use binaryTrainingSummary

commit d0aa9f19550deb620e515ec33004be365c5439be
Author: sethah <sh...@...>
Date:   2018-01-19T18:46:16Z

    import cleanup

commit cb6c811e98d9739a7c1608880b2d0037cdeb5990
Author: sethah <sh...@...>
Date:   2018-01-19T18:51:28Z

    clarify user guide

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164151687
  
    --- Diff: examples/src/main/scala/org/apache/spark/examples/ml/MulticlassLogisticRegressionWithElasticNetExample.scala ---
    @@ -49,6 +49,48 @@ object MulticlassLogisticRegressionWithElasticNetExample {
         // Print the coefficients and intercept for multinomial logistic regression
         println(s"Coefficients: \n${lrModel.coefficientMatrix}")
         println(s"Intercepts: \n${lrModel.interceptVector}")
    +
    +    val trainingSummary = lrModel.summary
    +
    +    val objectiveHistory = trainingSummary.objectiveHistory
    --- End diff --
    
    Done


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/48/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164476639
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -125,7 +123,8 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionModel).
    -Currently, only binary classification is supported. Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    There isn't a `binarySummary` method for python


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86397 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86397/testReport)** for PR 20332 at commit [`58d973e`](https://github.com/apache/spark/commit/58d973e204bd62128567fd3dfb2e5a335ac46bf1).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164151869
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -97,10 +97,6 @@ only available on the driver.
     [`LogisticRegressionTrainingSummary`](api/scala/index.html#org.apache.spark.ml.classification.LogisticRegressionTrainingSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/scala/index.html#org.apache.spark.ml.classification.LogisticRegressionModel).
    -Currently, only binary classification is supported and the
    --- End diff --
    
    Done.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/20332


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164654897
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -111,10 +110,9 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/LogisticRegressionTrainingSummary.html)
     provides a summary for a
     [`LogisticRegressionModel`](api/java/org/apache/spark/ml/classification/LogisticRegressionModel.html).
    -Currently, only binary classification is supported and the
    -summary must be explicitly cast to
    -[`BinaryLogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/BinaryLogisticRegressionTrainingSummary.html). 
    -Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    I'm ambivalent - I think it is fairly clear through the phrasing "additional metrics are available...", and in the API doc link provided.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by WeichenXu123 <gi...@git.apache.org>.
Github user WeichenXu123 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164531329
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -111,10 +110,9 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/LogisticRegressionTrainingSummary.html)
     provides a summary for a
     [`LogisticRegressionModel`](api/java/org/apache/spark/ml/classification/LogisticRegressionModel.html).
    -Currently, only binary classification is supported and the
    -summary must be explicitly cast to
    -[`BinaryLogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/BinaryLogisticRegressionTrainingSummary.html). 
    -Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    Oh no. Just add a sentence to make it more clear, like:
    "In the case of binary classification, `BinaryLogisticRegressionTrainingSummary` inherits all metrics in `LogisticRegressionSummary`, and certain additional metrics are added ..."
    Just a minor suggestion -:)


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/47/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass summary example and us...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86396 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86396/testReport)** for PR 20332 at commit [`cb6c811`](https://github.com/apache/spark/commit/cb6c811e98d9739a7c1608880b2d0037cdeb5990).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164479596
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -125,7 +123,8 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionModel).
    -Currently, only binary classification is supported. Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    Ah right! Missed that.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r162873036
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -97,10 +97,6 @@ only available on the driver.
     [`LogisticRegressionTrainingSummary`](api/scala/index.html#org.apache.spark.ml.classification.LogisticRegressionTrainingSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/scala/index.html#org.apache.spark.ml.classification.LogisticRegressionModel).
    -Currently, only binary classification is supported and the
    --- End diff --
    
    Should we add a note reflecting the difference between the summary and binary summary? Perhaps indicating the usage of `binarySummary` or `asBinary` method?
    
    I know it's done in the example but perhaps a short line about that.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r162873388
  
    --- Diff: examples/src/main/scala/org/apache/spark/examples/ml/MulticlassLogisticRegressionWithElasticNetExample.scala ---
    @@ -49,6 +49,48 @@ object MulticlassLogisticRegressionWithElasticNetExample {
         // Print the coefficients and intercept for multinomial logistic regression
         println(s"Coefficients: \n${lrModel.coefficientMatrix}")
         println(s"Intercepts: \n${lrModel.interceptVector}")
    +
    +    val trainingSummary = lrModel.summary
    +
    +    val objectiveHistory = trainingSummary.objectiveHistory
    --- End diff --
    
    ditto here for the comment to be consistent with Java / Python versions


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86713 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86713/testReport)** for PR 20332 at commit [`ac7a4ae`](https://github.com/apache/spark/commit/ac7a4aeb1c2f76e25f611c167ab8726069589a3e).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164151731
  
    --- Diff: examples/src/main/python/ml/multiclass_logistic_regression_with_elastic_net.py ---
    @@ -43,6 +43,43 @@
         # Print the coefficients and intercept for multinomial logistic regression
         print("Coefficients: \n" + str(lrModel.coefficientMatrix))
         print("Intercept: " + str(lrModel.interceptVector))
    +
    +    trainingSummary = lrModel.summary
    +
    +    # Obtain the objective per iteration
    +    objectiveHistory = trainingSummary.objectiveHistory
    +    print("objectiveHistory:")
    +    for objective in objectiveHistory:
    +        print(objective)
    +
    +    print("False positive rate by label:")
    --- End diff --
    
    Done


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86396 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86396/testReport)** for PR 20332 at commit [`cb6c811`](https://github.com/apache/spark/commit/cb6c811e98d9739a7c1608880b2d0037cdeb5990).
     * This patch **fails Python style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86713 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86713/testReport)** for PR 20332 at commit [`ac7a4ae`](https://github.com/apache/spark/commit/ac7a4aeb1c2f76e25f611c167ab8726069589a3e).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r162872261
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -125,7 +117,6 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionModel).
    --- End diff --
    
    Shall we just add a short line to the `Example` section of MLoR:
    
    "The following example shows how to train a multiclass logistic regression model with elastic net regularization, as well as extract the multiclass training summary." 
    
    or something like that.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164384660
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -111,10 +110,9 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/LogisticRegressionTrainingSummary.html)
     provides a summary for a
     [`LogisticRegressionModel`](api/java/org/apache/spark/ml/classification/LogisticRegressionModel.html).
    -Currently, only binary classification is supported and the
    -summary must be explicitly cast to
    -[`BinaryLogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/BinaryLogisticRegressionTrainingSummary.html). 
    -Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    What do you mean exactly? Do you propose to list the metrics in the user guide?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r162873193
  
    --- Diff: examples/src/main/python/ml/multiclass_logistic_regression_with_elastic_net.py ---
    @@ -43,6 +43,43 @@
         # Print the coefficients and intercept for multinomial logistic regression
         print("Coefficients: \n" + str(lrModel.coefficientMatrix))
         print("Intercept: " + str(lrModel.interceptVector))
    +
    +    trainingSummary = lrModel.summary
    +
    +    # Obtain the objective per iteration
    +    objectiveHistory = trainingSummary.objectiveHistory
    +    print("objectiveHistory:")
    +    for objective in objectiveHistory:
    +        print(objective)
    +
    +    print("False positive rate by label:")
    --- End diff --
    
    Do we want to have a consistent comment as per the Java version above?: `// for multiclass, we can inspect metrics on a per-label basis` 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/291/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass summary example and us...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    @jkbradley @MLnick 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Thanks a lot for your review, @MLnick!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86713/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164387272
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -125,7 +123,8 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionModel).
    -Currently, only binary classification is supported. Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    Missing " The binary summary can be accessed via the ..." sentence in this one


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by WeichenXu123 <gi...@git.apache.org>.
Github user WeichenXu123 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164237753
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -111,10 +110,9 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/LogisticRegressionTrainingSummary.html)
     provides a summary for a
     [`LogisticRegressionModel`](api/java/org/apache/spark/ml/classification/LogisticRegressionModel.html).
    -Currently, only binary classification is supported and the
    -summary must be explicitly cast to
    -[`BinaryLogisticRegressionTrainingSummary`](api/java/org/apache/spark/ml/classification/BinaryLogisticRegressionTrainingSummary.html). 
    -Support for multiclass model summaries will be added in the future.
    +In the case of binary classification, certain additional metrics are
    --- End diff --
    
    Now `BinaryLogisticRegressionTrainingSummary` inherits `LogisticRegressionSummary` so that inherits all metrics in `LogisticRegressionSummary`.  We'd better mark them in doc. :)


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    **[Test build #86397 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86397/testReport)** for PR 20332 at commit [`58d973e`](https://github.com/apache/spark/commit/58d973e204bd62128567fd3dfb2e5a335ac46bf1).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by MLnick <gi...@git.apache.org>.
Github user MLnick commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Merged to master / branch-2.3. Thanks @sethah, and @WeichenXu123 for review.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86397/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20332: [SPARK-23138][ML][DOC] Multiclass logistic regression su...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20332
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86396/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20332: [SPARK-23138][ML][DOC] Multiclass logistic regres...

Posted by sethah <gi...@git.apache.org>.
Github user sethah commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20332#discussion_r164151796
  
    --- Diff: docs/ml-classification-regression.md ---
    @@ -125,7 +117,6 @@ Continuing the earlier example:
     [`LogisticRegressionTrainingSummary`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionSummary)
     provides a summary for a
     [`LogisticRegressionModel`](api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegressionModel).
    --- End diff --
    
    Done.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org