You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sebastian Schelter (JIRA)" <ji...@apache.org> on 2013/07/21 20:16:48 UTC

[jira] [Created] (MAHOUT-1289) Move downsampling code into RowSimilarityJob

Sebastian Schelter created MAHOUT-1289:
------------------------------------------

             Summary: Move downsampling code into RowSimilarityJob
                 Key: MAHOUT-1289
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1289
             Project: Mahout
          Issue Type: Improvement
          Components: Math
            Reporter: Sebastian Schelter
            Assignee: Sebastian Schelter
             Fix For: 0.9


When computing similarities with RowSimilarityJob, downsampling highly frequent things is crucial for performance. At the moment, this is done by the data preparation code for collaborative filtering.

We should move the downsampling directly into RowSimilarityJob as we've seen a lot of cases where users want to directly use it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira