You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by mengxr <gi...@git.apache.org> on 2014/08/20 20:41:45 UTC

[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

GitHub user mengxr opened a pull request:

    https://github.com/apache/spark/pull/2064

    [SPARK-2843][MLLIB] add a section about regularization parameter in ALS

    @atalwalkar @srowen

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mengxr/spark als-doc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/2064.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2064
    
----
commit 1532caa90fb95fb8f1c4a4e84d3c9aacf44aa29b
Author: Xiangrui Meng <me...@databricks.com>
Date:   2014-08-20T18:39:21Z

    add a section about regularization parameter in ALS

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by mengxr <gi...@git.apache.org>.
Github user mengxr commented on the pull request:

    https://github.com/apache/spark/pull/2064#issuecomment-52824922
  
    @srowen Thanks for providing the reference! I added it to the doc.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/2064#issuecomment-52823039
  
    I think this particular technique is called "ALS-WR", at least, that's how I know of it from this paper: http://www.hpl.hp.com/personal/Robert_Schreiber/papers/2008%20AAIM%20Netflix/netflix_aaim08(submitted).pdf  You could add a mention of this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2064#issuecomment-52833451
  
      [QA tests have finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18976/consoleFull) for   PR 2064 at commit [`98abdd7`](https://github.com/apache/spark/commit/98abdd70d963cb8cdd4ab4870ca49c96ec39abaa).
     * This patch **passes** unit tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/2064


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by atalwalkar <gi...@git.apache.org>.
Github user atalwalkar commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2064#discussion_r16500714
  
    --- Diff: docs/mllib-collaborative-filtering.md ---
    @@ -43,6 +43,17 @@ level of confidence in observed user preferences, rather than explicit ratings g
     model then tries to find latent factors that can be used to predict the expected preference of a
     user for an item.
     
    +### Scaling of the regularization parameter
    +
    +Since v1.1, we scale the regularization parameter `lambda` in solving each least squares problem by
    +the number of ratings the user generated in updating user factors,
    +or the number of ratings the product received in updating product factors.
    +This approach is named "ALS-WR" and introduced in the paper
    --- End diff --
    
    This technique was used by many groups in the Netflix Prize (see section 5 of this paper for some more references: http://www.cs.toronto.edu/~rsalakhu/papers/weighted_tc.pdf).  Perhaps this sentence should be changed to "This approach is discussed in the following paper..."


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2064#issuecomment-52825040
  
      [QA tests have started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18976/consoleFull) for   PR 2064 at commit [`98abdd7`](https://github.com/apache/spark/commit/98abdd70d963cb8cdd4ab4870ca49c96ec39abaa).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-2843][MLLIB] add a section about regula...

Posted by mengxr <gi...@git.apache.org>.
Github user mengxr commented on the pull request:

    https://github.com/apache/spark/pull/2064#issuecomment-52865576
  
    I merged this into master and branch-1.1. Thanks @srowen and @atalwalkar for reviewing!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org