You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "ruxi zhang (JIRA)" <ji...@apache.org> on 2018/10/19 17:38:00 UTC

[jira] [Created] (SPARK-25781) relative importance of linear regression

ruxi zhang created SPARK-25781:
----------------------------------

             Summary: relative importance of linear regression
                 Key: SPARK-25781
                 URL: https://issues.apache.org/jira/browse/SPARK-25781
             Project: Spark
          Issue Type: New Feature
          Components: MLlib
    Affects Versions: 2.3.2
            Reporter: ruxi zhang


There is an R package relaimpo that generate relative importance for linear regression features.  This method utilizes sharply value regression, which will take a long time to run on big datasets.  This method is quite useful for many use cases such as attribution model in marketing.  It will be great if it is written in Spark with paralleled computing, which would be producing result within a much short time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org