You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@bigtop.apache.org by "jay vyas (JIRA)" <ji...@apache.org> on 2013/10/28 20:40:31 UTC

[jira] [Updated] (BIGTOP-1128) FIX and modularize mahout sample data sets

     [ https://issues.apache.org/jira/browse/BIGTOP-1128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

jay vyas updated BIGTOP-1128:
-----------------------------

    Summary: FIX and modularize mahout sample data sets  (was: modularize and update mahout sample data sets)

> FIX and modularize mahout sample data sets
> ------------------------------------------
>
>                 Key: BIGTOP-1128
>                 URL: https://issues.apache.org/jira/browse/BIGTOP-1128
>             Project: Bigtop
>          Issue Type: Bug
>            Reporter: jay vyas
>
> The mahout smokes have alot of dependencies
> Concretely, we need to fix  the movie lens sample data which has moved.... 
> from http://www.grouplens.org/system/files/ml-1m.zip
> to http://files.grouplens.org/papers/ml-1m.zip
> Otherwise mahout smokes break for obvious reasons. 
> More generally, consolidating and verifying these download URLs in a separate function might make for simpler debugging of the tests, otherwise, you get html documents stored as .zip files, which causes a very hard to interpret error in the tests, i.e. you get an exception about how the zip file isnt formatted correctly.  
> Other Thoughts on how to simplify and isolate moving parts of mahout tests?
> We can bundle them into a patch.  Would be a shame if the only thing this JIRA resulted in was a fix to a single URL :).... 



--
This message was sent by Atlassian JIRA
(v6.1#6144)