You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2017/11/02 05:17:00 UTC

[jira] [Commented] (SPARK-22422) Add Adjusted R2 to RegressionMetrics

    [ https://issues.apache.org/jira/browse/SPARK-22422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16235195#comment-16235195 ] 

Apache Spark commented on SPARK-22422:
--------------------------------------

User 'tengpeng' has created a pull request for this issue:
https://github.com/apache/spark/pull/19638

> Add Adjusted R2 to RegressionMetrics
> ------------------------------------
>
>                 Key: SPARK-22422
>                 URL: https://issues.apache.org/jira/browse/SPARK-22422
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 2.2.0
>            Reporter: Teng Peng
>            Priority: Minor
>
> In practice, no one looks at R2 alone. The reason is R2 itself is misleading. If we add more parameters, R2 will not decrease but only increase (or stay the same). This leads to overfitting.
> I added adjusted R2 as the metric which was implemented in all major statistical analysis tools.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org