You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sebastian Schelter (JIRA)" <ji...@apache.org> on 2011/03/18 20:23:29 UTC
[jira] Created: (MAHOUT-628) Add an option to prune away users with
less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
----------------------------------------------------------------------------------------------------------------------
Key: MAHOUT-628
URL: https://issues.apache.org/jira/browse/MAHOUT-628
Project: Mahout
Issue Type: New Feature
Components: Collaborative Filtering
Affects Versions: 0.5
Reporter: Sebastian Schelter
Assignee: Sebastian Schelter
Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Commented: (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008692#comment-13008692 ]
Sebastian Schelter commented on MAHOUT-628:
-------------------------------------------
Could be done like this in the non-distributed code, the patch only covers the hadoop related code.
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Schelter updated MAHOUT-628:
--------------------------------------
Status: Patch Available (was: In Progress)
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13009900#comment-13009900 ]
Hudson commented on MAHOUT-628:
-------------------------------
Integrated in Mahout-Quality #686 (See [https://hudson.apache.org/hudson/job/Mahout-Quality/686/])
MAHOUT-628 Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 0.5
>
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Commented: (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Lance Norskog (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008683#comment-13008683 ]
Lance Norskog commented on MAHOUT-628:
--------------------------------------
Could this be done with a DataModel wrapper instead?
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Commented: (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13008911#comment-13008911 ]
Sean Owen commented on MAHOUT-628:
----------------------------------
Looks reasonable enough to proceed with
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Updated: (MAHOUT-628) Add an option to prune away users with
less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Schelter updated MAHOUT-628:
--------------------------------------
Attachment: MAHOUT-628.patch
Patch attached. I'll commit this in the next days if there are no objections.
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-628) Add an option to prune away users
with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Schelter updated MAHOUT-628:
--------------------------------------
Resolution: Fixed
Fix Version/s: 0.5
Status: Resolved (was: Patch Available)
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 0.5
>
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Work started] (MAHOUT-628) Add an option to prune away
users with less than a given number of preferences to ItemSimilarityJob and
RecommenderJob
Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on MAHOUT-628 started by Sebastian Schelter.
> Add an option to prune away users with less than a given number of preferences to ItemSimilarityJob and RecommenderJob
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-628
> URL: https://issues.apache.org/jira/browse/MAHOUT-628
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Affects Versions: 0.5
> Reporter: Sebastian Schelter
> Assignee: Sebastian Schelter
> Fix For: 0.5
>
> Attachments: MAHOUT-628.patch
>
>
> Some real-world datasets (especially those created from implicit feedback) might include users with only a tiny number of preferences (like one-time-visitors only viewing a single item) that a users of ItemSimilarityJob or RecommenderJob might want to prune away. I added a new parameter "minPrefsPerUser" that makes those jobs throw out users with less than a given number of preferences. It is per default set to 1 so that the input data stays untouched.
> It's just a small patch to make those jobs more usable in real-world scenarios.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira