You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2010/09/24 14:34:33 UTC

[jira] Updated: (MAHOUT-504) Kmeans clustering error

     [ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-504:
-----------------------------

         Assignee: Robin Anil
    Fix Version/s: 0.5

> Kmeans clustering error
> -----------------------
>
>                 Key: MAHOUT-504
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-504
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Zhen Guo
>            Assignee: Robin Anil
>             Fix For: 0.5
>
>
> I tried the Kmeans algorithm on the Synthetic Control data. The following error appears. I tried the Canopy algorithm, it is fine. This error is from Mapper. I am using Trunk.
> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : attempt_201008261432_1324_m_000000_0, Status : FAILED
> java.lang.IllegalStateException: Cluster is empty!
> 	at org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
> 	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
> 	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
> 	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
> 	at org.apache.hadoop.mapred.Child.main(Child.java:170)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Updated: (MAHOUT-504) Kmeans clustering error

Posted by Jeff Eastman <jd...@windwardsolutions.com>.
  My bad. The example should not even have a -c parameter as it uses 
Canopy to populate the initial clusters and they go into a default 
directory. I will fix asap.

On 9/24/10 10:05 AM, Jeff Eastman wrote:
>  This error was likely caused by incorrect -c parameter. The error 
> message was misleading. I committed a better message earlier this 
> week. Synthetic control works reliably with k-Means when the arguments 
> are given correctly. I think this can be closed.
>
> On 9/24/10 8:34 AM, Sean Owen (JIRA) wrote:
>>       [ 
>> https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel 
>> ]
>>
>> Sean Owen updated MAHOUT-504:
>> -----------------------------
>>
>>           Assignee: Robin Anil
>>      Fix Version/s: 0.5
>>
>>> Kmeans clustering error
>>> -----------------------
>>>
>>>                  Key: MAHOUT-504
>>>                  URL: https://issues.apache.org/jira/browse/MAHOUT-504
>>>              Project: Mahout
>>>           Issue Type: Bug
>>>             Reporter: Zhen Guo
>>>             Assignee: Robin Anil
>>>              Fix For: 0.5
>>>
>>>
>>> I tried the Kmeans algorithm on the Synthetic Control data. The 
>>> following error appears. I tried the Canopy algorithm, it is fine. 
>>> This error is from Mapper. I am using Trunk.
>>> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : 
>>> attempt_201008261432_1324_m_000000_0, Status : FAILED
>>> java.lang.IllegalStateException: Cluster is empty!
>>>     at 
>>> org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
>>>     at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>>>     at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
>>>     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>>>     at org.apache.hadoop.mapred.Child.main(Child.java:170)
>


Re: [jira] Updated: (MAHOUT-504) Kmeans clustering error

Posted by Jeff Eastman <jd...@windwardsolutions.com>.
  This error was likely caused by incorrect -c parameter. The error 
message was misleading. I committed a better message earlier this week. 
Synthetic control works reliably with k-Means when the arguments are 
given correctly. I think this can be closed.

On 9/24/10 8:34 AM, Sean Owen (JIRA) wrote:
>       [ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
>
> Sean Owen updated MAHOUT-504:
> -----------------------------
>
>           Assignee: Robin Anil
>      Fix Version/s: 0.5
>
>> Kmeans clustering error
>> -----------------------
>>
>>                  Key: MAHOUT-504
>>                  URL: https://issues.apache.org/jira/browse/MAHOUT-504
>>              Project: Mahout
>>           Issue Type: Bug
>>             Reporter: Zhen Guo
>>             Assignee: Robin Anil
>>              Fix For: 0.5
>>
>>
>> I tried the Kmeans algorithm on the Synthetic Control data. The following error appears. I tried the Canopy algorithm, it is fine. This error is from Mapper. I am using Trunk.
>> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : attempt_201008261432_1324_m_000000_0, Status : FAILED
>> java.lang.IllegalStateException: Cluster is empty!
>> 	at org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
>> 	at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>> 	at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
>> 	at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>> 	at org.apache.hadoop.mapred.Child.main(Child.java:170)