You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Reinis Vicups <ma...@orbit-x.de> on 2014/03/28 17:13:53 UTC

resplit not generating splits

Hi,

when I run "mahout resplit", I get this output:

support@hadoop1:~$ mahout resplit --input 
.../final/clusteredPoints/part-m-* --output .../final/split --numSplits 4
MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
Running on hadoop, using 
/opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/bin/../lib/hadoop/bin/hadoop 
and HADOOP_CONF_DIR=/etc/hadoop/conf
MAHOUT-JOB: 
/opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/lib/mahout/mahout-examples-0.8-cdh5.0.0-beta-2-job.jar
14/03/28 16:22:50 WARN driver.MahoutDriver: No resplit.props found on 
classpath, will use command-line arguments only
Writing 4 splits
Writing split 0
Writing split 1
Writing split 2
Writing split 3
14/03/28 16:22:52 INFO driver.MahoutDriver: Program took 2077 ms 
(Minutes: 0.034616666666666664)

but there are no files generated nor changed. I tried different path 
combos and different numSplits to a no avail.

My question is - what gives?

kind regards
reinis

Re: resplit not generating splits

Posted by Andrew Musselman <an...@gmail.com>.
I'll take a look this weekend.  Could you post the steps you took to get to
this step?

If you'd like to help further, you could create an account on the Apache
Jira page for Mahout and file a bug detailing these steps:
https://issues.apache.org/jira/browse/MAHOUT

Thanks
Andrew


On Fri, Mar 28, 2014 at 9:13 AM, Reinis Vicups <ma...@orbit-x.de> wrote:

> Hi,
>
> when I run "mahout resplit", I get this output:
>
> support@hadoop1:~$ mahout resplit --input .../final/clusteredPoints/part-m-*
> --output .../final/split --numSplits 4
> MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> Running on hadoop, using /opt/cloudera/parcels/CDH-5.0.
> 0-0.cdh5b2.p0.27/bin/../lib/hadoop/bin/hadoop and
> HADOOP_CONF_DIR=/etc/hadoop/conf
> MAHOUT-JOB: /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/lib/mahout/
> mahout-examples-0.8-cdh5.0.0-beta-2-job.jar
> 14/03/28 16:22:50 WARN driver.MahoutDriver: No resplit.props found on
> classpath, will use command-line arguments only
> Writing 4 splits
> Writing split 0
> Writing split 1
> Writing split 2
> Writing split 3
> 14/03/28 16:22:52 INFO driver.MahoutDriver: Program took 2077 ms (Minutes:
> 0.034616666666666664)
>
> but there are no files generated nor changed. I tried different path
> combos and different numSplits to a no avail.
>
> My question is - what gives?
>
> kind regards
> reinis
>