You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bigtop.apache.org by "bhashit parikh (JIRA)" <ji...@apache.org> on 2014/07/03 05:43:25 UTC
[jira] [Comment Edited] (BIGTOP-1272) BigPetStore: Productionize
the Mahout recommender
[ https://issues.apache.org/jira/browse/BIGTOP-1272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14051025#comment-14051025 ]
bhashit parikh edited comment on BIGTOP-1272 at 7/3/14 3:41 AM:
----------------------------------------------------------------
Submitted the patch with the code. Haven't updated arch.dot yet since I want to test out the whole flow using the {{hadoop jar}} commands with the {{mahout}} jobs once before updating it.
To run the recommender, we first need to run the pig clean job using
{noformat}
gradle clean integrationTest -PITProfile=pig
{noformat}
and then the mahout jobs:
{noformat}
gradle integrationTest -PITProfile=mahout
{noformat}
was (Author: bhashit):
Submitted the patch with the code. Haven't updated arch.dot yet since I want to test out the whole flow using the {{hadoop jar}} commands with the {{mahout}} jobs once before updating it.
To run the recommender, we first need to run the pig clean job using
{noformat}
gradle clean integrationTest -PITProfile=pig
{noformat}
and then the mahout jobs:
{noformat}
gradle clean integrationTest -PITProfile=mahout
{noformat}
> BigPetStore: Productionize the Mahout recommender
> -------------------------------------------------
>
> Key: BIGTOP-1272
> URL: https://issues.apache.org/jira/browse/BIGTOP-1272
> Project: Bigtop
> Issue Type: New Feature
> Components: Blueprints
> Affects Versions: backlog
> Reporter: jay vyas
> Attachments: BIGTOP-1272.patch, arch.jpeg
>
>
> BIGTOP-1271 adds patterns into the data that gaurantee that a meaningfull type of product recommendation can be given for at least *some* customers, since we know that there are going to be many customers who only bought 1 product, and also customers that bought 2 or more products -- even in a dataset size of 10. due to the gaussian distribution of purchases that is also in the dataset generator.
> The current mahout recommender code is statically valid: It runs to completion in local unit tests if a hadoop 1x tarball is present but otherwise it hasn't been tested at scale. So, lets get it working. this JIRA also will comprise:
> - deciding wether to use mahout 2x for unit tests (default on mahout maven repo is the 1x impl) and wether or not bigtop should host a mahout 2x jar? After all, bigtop builds a mahout 2x jar as part of its packaging process, and BigPetStore might thus need a mahout 2x jar in order to test against the right same of bigtop releases.
--
This message was sent by Atlassian JIRA
(v6.2#6252)