You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bigtop.apache.org by "jay vyas (JIRA)" <ji...@apache.org> on 2014/04/16 12:14:14 UTC
[jira] [Created] (BIGTOP-1270) BigPetStore: Productionize the Hive
portion
jay vyas created BIGTOP-1270:
--------------------------------
Summary: BigPetStore: Productionize the Hive portion
Key: BIGTOP-1270
URL: https://issues.apache.org/jira/browse/BIGTOP-1270
Project: Bigtop
Issue Type: New Feature
Components: Blueprints
Reporter: jay vyas
The hive portion of the BigPetStore blueprint app builds a "view" over the cleaned data that Mahout can then use to do product recommendations.
The hive code in bigpetstore only runs locally - lets add the necessary configuration hooks and/or (if we have to) externalize the hive script itself from java so that its easy to run directly on a cluster.
And lets actually run it on some kind of a cluster at scale. The contract for the hive portion is an output file with three numbers like this:
{noformat}
100 30021 1
100 212341 1
...
{noformat}
Signifying that customer=100 likes both of the products "30021" and "212341".
--
This message was sent by Atlassian JIRA
(v6.2#6252)