You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Joseph K. Bradley (JIRA)" <ji...@apache.org> on 2018/04/18 00:45:00 UTC
[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator,
RegressionEvaluator, and MulticlassClassificationEvaluator should use
sample weight data
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16441713#comment-16441713 ]
Joseph K. Bradley commented on SPARK-18693:
-------------------------------------------
[~imatiach] Would you mind creating JIRA subtasks so that we have 1 PR per JIRA? That helps with tracking. Thanks!
> BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data
> -----------------------------------------------------------------------------------------------------------------------
>
> Key: SPARK-18693
> URL: https://issues.apache.org/jira/browse/SPARK-18693
> Project: Spark
> Issue Type: Improvement
> Components: ML
> Affects Versions: 2.0.2
> Reporter: Devesh Parekh
> Priority: Major
>
> The LogisticRegression and LinearRegression models support training with a weight column, but the corresponding evaluators do not support computing metrics using those weights. This breaks model selection using CrossValidator.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org