You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Reza Zadeh (JIRA)" <ji...@apache.org> on 2014/12/29 03:05:13 UTC

[jira] [Commented] (SPARK-4981) Add a streaming singular value decomposition

    [ https://issues.apache.org/jira/browse/SPARK-4981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14259789#comment-14259789 ] 

Reza Zadeh commented on SPARK-4981:
-----------------------------------

We could do matrix completion (least squares objective, reqularized, note that this is not SVD) in a streaming fashion using Stochastic Gradient Descent.

See the update equations in Algorithm 1:
http://stanford.edu/~rezab/papers/factorbird.pdf

The stream is over individual entries (as opposed a whole row/column).

We should probably do streaming matrix completion before streaming SVD.

> Add a streaming singular value decomposition
> --------------------------------------------
>
>                 Key: SPARK-4981
>                 URL: https://issues.apache.org/jira/browse/SPARK-4981
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib, Streaming
>            Reporter: Jeremy Freeman
>
> This is for tracking WIP on a streaming singular value decomposition implementation. This will likely be more complex than the existing streaming algorithms (k-means, regression), but should be possible using the family of sequential update rule outlined in this paper:
> "Fast low-rank modifications of the thin singular value decomposition"
> by Matthew Brand
> http://www.stat.osu.edu/~dmsl/thinSVDtracking.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org