You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yanbo Liang (JIRA)" <ji...@apache.org> on 2015/08/10 12:33:45 UTC
[jira] [Commented] (SPARK-8518) Log-linear models for survival
analysis
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14679918#comment-14679918 ]
Yanbo Liang commented on SPARK-8518:
------------------------------------
[~mengxr] [~meihuawu]
I have update the design document, please review and comment it.
For the questions I can also clarify here:
1. Which algorithm is the most popular one?
Accelerated Failure Time (AFT) model
2. What is the size of the model?
It contains a weight vector (with intercept) and a scale parameter.
3. How do the algorithms fit into Spark? Are they easy to be parallelized?
I have got the loss function and gradient function, so we can parallelize it using SGD or L-BFGS.
4. What is the complexity?
It has the same complexity with LinearRegression.
> Log-linear models for survival analysis
> ---------------------------------------
>
> Key: SPARK-8518
> URL: https://issues.apache.org/jira/browse/SPARK-8518
> Project: Spark
> Issue Type: New Feature
> Components: ML
> Reporter: Xiangrui Meng
> Assignee: Yanbo Liang
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> We want to add basic log-linear models for survival analysis. The implementation should match the result from R's survival package (http://cran.r-project.org/web/packages/survival/index.html).
> Design doc from [~yanboliang]: https://docs.google.com/document/d/1fLtB0sqg2HlfqdrJlNHPhpfXO0Zb2_avZrxiVoPEs0E/pub
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org