You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2015/08/11 21:22:45 UTC

[jira] [Created] (SPARK-9834) Normal equation solver and summary statistics for ordinary least squares

Xiangrui Meng created SPARK-9834:
------------------------------------

             Summary: Normal equation solver and summary statistics for ordinary least squares
                 Key: SPARK-9834
                 URL: https://issues.apache.org/jira/browse/SPARK-9834
             Project: Spark
          Issue Type: New Feature
          Components: ML
            Reporter: Xiangrui Meng
            Assignee: Xiangrui Meng


Add normal equation solver for ordinary least squares with not many features. The approach requires one pass to collect AtA and Atb, then solve the problem on driver. It works well when the problem is not very ill-conditioned and not having many columns. It also provides R-like summary statistics.

We can hide this implementation under LinearRegression. It is triggered when there are no more than, e.g., 4096 features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org