You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2018/04/18 00:45:00 UTC

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

    [ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16441713#comment-16441713 ] 

Joseph K. Bradley commented on SPARK-18693:
-------------------------------------------

[~imatiach] Would you mind creating JIRA subtasks so that we have 1 PR per JIRA?  That helps with tracking.  Thanks!

> BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data
> -----------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-18693
>                 URL: https://issues.apache.org/jira/browse/SPARK-18693
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 2.0.2
>            Reporter: Devesh Parekh
>            Priority: Major
>
> The LogisticRegression and LinearRegression models support training with a weight column, but the corresponding evaluators do not support computing metrics using those weights. This breaks model selection using CrossValidator.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org