You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by Raghuveer <al...@yahoo.com.INVALID> on 2015/03/09 11:10:44 UTC

kmeans is throwing IllegalArgumentException

Hi All,
I am trying to run the following command:
./mahout kmeans -i hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-00000 -o  hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer -c  hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 5 -ow -cl -k 25
getting the following exception:
IllegalStateException: No input clusters found in hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Check your -c argument.
kindly suggest how i can get ride of this exception. 

Note : i see a vector in part-r-00000 but why it says "no input" is not clear to me.
regards,.
 

Re: kmeans is throwing IllegalArgumentException

Posted by Suneel Marthi <su...@gmail.com>.
Yes, that's correct

On Mon, Mar 9, 2015 at 1:53 PM, Pat Ferrel <pa...@occamsmachete.com> wrote:

> I think you don’t want to supply a -c argument unless you have seed
> vectors in
> /user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Just
> leave it out and Mahout will use random seeds.
>
> BTW you’ll get help faster if you post to the user list
> On Mar 9, 2015, at 3:10 AM, Raghuveer <al...@yahoo.com.INVALID>
> wrote:
>
> Hi All,
> I am trying to run the following command:
> ./mahout kmeans -i
> hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-00000
> -o
> hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer
> -c
> hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids
> -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 5 -ow -cl -k
> 25
> getting the following exception:
> IllegalStateException: No input clusters found in
> hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed.
> Check your -c argument.
> kindly suggest how i can get ride of this exception.
>
> Note : i see a vector in part-r-00000 but why it says "no input" is not
> clear to me.
> regards,.
>
>
>

Re: kmeans is throwing IllegalArgumentException

Posted by Pat Ferrel <pa...@occamsmachete.com>.
I think you don’t want to supply a -c argument unless you have seed vectors in /user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Just leave it out and Mahout will use random seeds.
 
BTW you’ll get help faster if you post to the user list
On Mar 9, 2015, at 3:10 AM, Raghuveer <al...@yahoo.com.INVALID> wrote:

Hi All,
I am trying to run the following command:
./mahout kmeans -i hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-00000 -o  hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer -c  hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 5 -ow -cl -k 25
getting the following exception:
IllegalStateException: No input clusters found in hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Check your -c argument.
kindly suggest how i can get ride of this exception. 

Note : i see a vector in part-r-00000 but why it says "no input" is not clear to me.
regards,.



Re: kmeans is throwing IllegalArgumentException

Posted by Pat Ferrel <pa...@occamsmachete.com>.
I think you don’t want to supply a -c argument unless you have seed vectors in /user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Just leave it out and Mahout will use random seeds.
 
BTW you’ll get help faster if you post to the user list
On Mar 9, 2015, at 3:10 AM, Raghuveer <al...@yahoo.com.INVALID> wrote:

Hi All,
I am trying to run the following command:
./mahout kmeans -i hdfs://master:54310/user/netlog/upload/output4/tfidf-vectors/part-r-00000 -o  hdfs://master:54310//user/netlog/upload/output4/tfidf-vectors-kmeans-clusters-raghuveer -c  hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids -dm org.apache.mahout.common.distance.CosineDistanceMeasure -x 5 -ow -cl -k 25
getting the following exception:
IllegalStateException: No input clusters found in hdfs://master:54310/user/netlog/upload/output4/uscensus-kmeans-centroids/part-randomSeed. Check your -c argument.
kindly suggest how i can get ride of this exception. 

Note : i see a vector in part-r-00000 but why it says "no input" is not clear to me.
regards,.