You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Suneel Marthi (JIRA)" <ji...@apache.org> on 2014/07/28 13:41:39 UTC
[jira] [Commented] (MAHOUT-1593) cluster-reuters.sh does not work complaining java.lang.IllegalStateException

    [ https://issues.apache.org/jira/browse/MAHOUT-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14076149#comment-14076149 ] 

Suneel Marthi commented on MAHOUT-1593:
---------------------------------------

[~jaehoon13.ko] Mahout 0.9 doesn't support Hadoop 2x and hence the issues u r seeing. Nevertheless these need to be fixed in the present trunk for Hadoop 2x.

Since we r now using Git as source code control, could u create a Git Pull request as opposed to submitting a patch. 
Also I am assuming that u r generating a patch against the present trunk and not from 0.9 codebase, correct?

> cluster-reuters.sh does not work complaining java.lang.IllegalStateException
> ----------------------------------------------------------------------------
>
>                 Key: MAHOUT-1593
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1593
>             Project: Mahout
>          Issue Type: Bug
>          Components: Examples
>    Affects Versions: 0.9
>         Environment: Hadoop version: 2.4.0.2.1.1.0-385
> Git hash: 2b65475c3ab682ebd47cffdc6b502698799cd2c8 (trunk)
>            Reporter: jaehoon ko
>            Priority: Minor
>              Labels: patch
>             Fix For: 1.0
>
>         Attachments: MAHOUT-1593.patch
>
>
> When I choose "kmeans clustering" in cluster-reuters.sh, clusterdump complains java.lang.IllegalStateException as follows:
> {code:borderStyle=solid}
> Exception in thread "main" java.lang.IllegalStateException: /tmp/mahout-work-user/reuters-kmeans/clusters-*-final
>         at org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterable.iterator(SequenceFileDirValueIterable.java:78)
>         at org.apache.mahout.clustering.evaluation.ClusterEvaluator.loadClusters(ClusterEvaluator.java:93)
>         at org.apache.mahout.clustering.evaluation.ClusterEvaluator.<init>(ClusterEvaluator.java:81)
>         at org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:208)
>         at org.apache.mahout.utils.clustering.ClusterDumper.run(ClusterDumper.java:157)
>         at org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:101)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
>         at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:145)
>         at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:153)
>         at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
> Caused by: java.io.FileNotFoundException: File /tmp/mahout-work-user/reuters-kmeans/clusters-*-final does not exist.
>         at org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
>         at org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
>         at org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
>         at org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
>         at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
>         at org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
>         at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1483)
>         at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1523)
>         at org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterator.<init>(SequenceFileDirValueIterator.java:70)
>         at org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterable.iterator(SequenceFileDirValueIterable.java:76)
>         ... 18 more
> {code}
> Other clustering options run well.



--
This message was sent by Atlassian JIRA
(v6.2#6252)