You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Antonio Molins (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2012/02/03 23:58:54 UTC

[jira] [Issue Comment Edited] (MAHOUT-971) kmeans does not work in S3

    [ https://issues.apache.org/jira/browse/MAHOUT-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13200118#comment-13200118 ] 

Antonio Molins edited comment on MAHOUT-971 at 2/3/12 10:57 PM:
----------------------------------------------------------------

I am new and didn't get around how to make commits in SVN, but was able to fix this by modifying the offending lines 

core/src/main/java/org/apache/mahout/common/iterator/sequencefile/SequenceFileDirValueIterator.java:66

to

FileSystem fs = FileSystem.get(path.toUri(), conf);

and

core/src/main/java/org/apache/mahout/clustering/kmeans/KMeansDriver.java:298
core/src/main/java/org/apache/mahout/clustering/kmeans/KMeansDriver.java:322

to

FileSystem.get(output.toUri(), conf).rename(new Path(output, AbstractCluster.CLUSTERS_DIR + (iteration-1)), finalClustersIn);


                
      was (Author: amolins):
    I am new and didn't get around how to make commits in SVN, but was able to fix this by modifying the offending lines 

core/src/main/java/org/apache/mahout/common/iterator/sequencefile/SequenceFileDirValueIterator.java:66
core/src/main/java/org/apache/mahout/common/iterator/sequencefile/SequenceFileDirValueIterator.java:89

to

FileSystem fs = FileSystem.get(path.toUri(), conf);

and

core/src/main/java/org/apache/mahout/clustering/kmeans/KMeansDriver.java:298
core/src/main/java/org/apache/mahout/clustering/kmeans/KMeansDriver.java:322

to

FileSystem.get(output.toUri(), conf).rename(new Path(output, AbstractCluster.CLUSTERS_DIR + (iteration-1)), finalClustersIn);


                  
> kmeans does not work in S3
> --------------------------
>
>                 Key: MAHOUT-971
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-971
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.6
>         Environment: amazon S3
>            Reporter: Antonio Molins
>              Labels: hadoop
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> S3n:// URIs will not work in kmeans because of a couple of calls to FileSystem.get(conf) with no URI. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira