You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Reza Zadeh (JIRA)" <ji...@apache.org> on 2015/01/31 05:42:34 UTC

[jira] [Commented] (SPARK-4981) Add a streaming singular value decomposition

    [ https://issues.apache.org/jira/browse/SPARK-4981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14299666#comment-14299666 ] 

Reza Zadeh commented on SPARK-4981:
-----------------------------------

To be model parallel, we can simply warm-start the current ALS implementation in org.apache.spark.mllib.recommendation

The work involved would be to expose a warm-start option in ALS, and then redo training with say 2 iterations instead of 10, with each batch of RDDs.

The stream would be over batches of Ratings.

This should be the simplest option.


> Add a streaming singular value decomposition
> --------------------------------------------
>
>                 Key: SPARK-4981
>                 URL: https://issues.apache.org/jira/browse/SPARK-4981
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib, Streaming
>            Reporter: Jeremy Freeman
>
> This is for tracking WIP on a streaming singular value decomposition implementation. This will likely be more complex than the existing streaming algorithms (k-means, regression), but should be possible using the family of sequential update rule outlined in this paper:
> "Fast low-rank modifications of the thin singular value decomposition"
> by Matthew Brand
> http://www.stat.osu.edu/~dmsl/thinSVDtracking.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org