You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sebastian Schelter (JIRA)" <ji...@apache.org> on 2014/04/18 15:13:15 UTC

[jira] [Resolved] (MAHOUT-1497) mahout resplit not producing splited files

     [ https://issues.apache.org/jira/browse/MAHOUT-1497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Schelter resolved MAHOUT-1497.
----------------------------------------

    Resolution: Fixed
      Assignee: Sebastian Schelter

I added a test that checks whether the splits are generated at https://svn.apache.org/viewvc/mahout/trunk/core/src/test/java/org/apache/mahout/clustering/streaming/tools/ResplitSequenceFilesTest.java?view=log

Tests work well, couldn't find any error, so I'm resolving this. Please try again and reopen if the resplitting doesn't work for you.

> mahout resplit not producing splited files
> ------------------------------------------
>
>                 Key: MAHOUT-1497
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1497
>             Project: Mahout
>          Issue Type: Bug
>          Components: CLI
>    Affects Versions: 0.8
>            Reporter: Reinis Vicups
>            Assignee: Sebastian Schelter
>             Fix For: 1.0
>
>
> when I run "mahout resplit", I get the output below but no split files are being produced.
> {code}
> support@hadoop1:~$ mahout resplit --input .../final/clusteredPoints/part-m-* --output .../final/split --numSplits 4
> MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
> Running on hadoop, using /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/bin/../lib/hadoop/bin/hadoop and HADOOP_CONF_DIR=/etc/hadoop/conf
> MAHOUT-JOB: /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/lib/mahout/mahout-examples-0.8-cdh5.0.0-beta-2-job.jar
> 14/03/28 16:22:50 WARN driver.MahoutDriver: No resplit.props found on classpath, will use command-line arguments only
> Writing 4 splits
> Writing split 0
> Writing split 1
> Writing split 2
> Writing split 3
> 14/03/28 16:22:52 INFO driver.MahoutDriver: Program took 2077 ms (Minutes: 0.034616666666666664)
> {code}
> The folder "cluteredPoints" passed to --input of resplit contains clustered points generated by k-means algorithm from mahout.



--
This message was sent by Atlassian JIRA
(v6.2#6252)