You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "chris snow (JIRA)" <ji...@apache.org> on 2017/03/23 14:17:41 UTC

[jira] [Created] (SPARK-20072) Clarify ALS-WR documentation

chris snow created SPARK-20072:
----------------------------------

             Summary: Clarify ALS-WR documentation
                 Key: SPARK-20072
                 URL: https://issues.apache.org/jira/browse/SPARK-20072
             Project: Spark
          Issue Type: Improvement
          Components: Documentation
    Affects Versions: 2.1.0
            Reporter: chris snow
            Priority: Trivial


https://www.mail-archive.com/user@spark.apache.org/msg62590.html

The documentation for collaborative filtering is as follows:

===
Scaling of the regularization parameter

Since v1.1, we scale the regularization parameter lambda in solving
each least squares problem by the number of ratings the user generated
in updating user factors, or the number of ratings the product
received in updating product factors.
===

I find this description confusing, probably because I lack a detailed
understanding of ALS.   The wording suggest that the number of ratings
change ("generated", "received") during solving the least squares.

This is how I think I should be interpreting the description:

===
Since v1.1, we scale the regularization parameter lambda when solving
each least squares problem.  When updating the user factors, we scale
the regularization parameter by the total number of ratings from the
user.  Similarly, when updating the product factors, we scale the
regularization parameter by the total number of ratings for the
product.
===



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org