You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2017/11/15 16:14:00 UTC

[jira] [Resolved] (SPARK-22422) Add Adjusted R2 to RegressionMetrics

     [ https://issues.apache.org/jira/browse/SPARK-22422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved SPARK-22422.
-------------------------------
       Resolution: Fixed
    Fix Version/s: 2.3.0

Issue resolved by pull request 19638
[https://github.com/apache/spark/pull/19638]

> Add Adjusted R2 to RegressionMetrics
> ------------------------------------
>
>                 Key: SPARK-22422
>                 URL: https://issues.apache.org/jira/browse/SPARK-22422
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 2.2.0
>            Reporter: Teng Peng
>            Priority: Minor
>             Fix For: 2.3.0
>
>
> In practice, no one looks at R2 alone. The reason is R2 itself is misleading. If we add more parameters, R2 will not decrease but only increase (or stay the same). This leads to overfitting.
> I added adjusted R2 as the metric which was implemented in all major statistical analysis tools.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org