You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2010/09/24 14:34:33 UTC
[jira] Updated: (MAHOUT-504) Kmeans clustering error
[ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen updated MAHOUT-504:
-----------------------------
Assignee: Robin Anil
Fix Version/s: 0.5
> Kmeans clustering error
> -----------------------
>
> Key: MAHOUT-504
> URL: https://issues.apache.org/jira/browse/MAHOUT-504
> Project: Mahout
> Issue Type: Bug
> Reporter: Zhen Guo
> Assignee: Robin Anil
> Fix For: 0.5
>
>
> I tried the Kmeans algorithm on the Synthetic Control data. The following error appears. I tried the Canopy algorithm, it is fine. This error is from Mapper. I am using Trunk.
> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : attempt_201008261432_1324_m_000000_0, Status : FAILED
> java.lang.IllegalStateException: Cluster is empty!
> at org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
> at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
> at org.apache.hadoop.mapred.Child.main(Child.java:170)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
Re: [jira] Updated: (MAHOUT-504) Kmeans clustering error
Posted by Jeff Eastman <jd...@windwardsolutions.com>.
My bad. The example should not even have a -c parameter as it uses
Canopy to populate the initial clusters and they go into a default
directory. I will fix asap.
On 9/24/10 10:05 AM, Jeff Eastman wrote:
> This error was likely caused by incorrect -c parameter. The error
> message was misleading. I committed a better message earlier this
> week. Synthetic control works reliably with k-Means when the arguments
> are given correctly. I think this can be closed.
>
> On 9/24/10 8:34 AM, Sean Owen (JIRA) wrote:
>> [
>> https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
>> ]
>>
>> Sean Owen updated MAHOUT-504:
>> -----------------------------
>>
>> Assignee: Robin Anil
>> Fix Version/s: 0.5
>>
>>> Kmeans clustering error
>>> -----------------------
>>>
>>> Key: MAHOUT-504
>>> URL: https://issues.apache.org/jira/browse/MAHOUT-504
>>> Project: Mahout
>>> Issue Type: Bug
>>> Reporter: Zhen Guo
>>> Assignee: Robin Anil
>>> Fix For: 0.5
>>>
>>>
>>> I tried the Kmeans algorithm on the Synthetic Control data. The
>>> following error appears. I tried the Canopy algorithm, it is fine.
>>> This error is from Mapper. I am using Trunk.
>>> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id :
>>> attempt_201008261432_1324_m_000000_0, Status : FAILED
>>> java.lang.IllegalStateException: Cluster is empty!
>>> at
>>> org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
>>> at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>>> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
>>> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>>> at org.apache.hadoop.mapred.Child.main(Child.java:170)
>
Re: [jira] Updated: (MAHOUT-504) Kmeans clustering error
Posted by Jeff Eastman <jd...@windwardsolutions.com>.
This error was likely caused by incorrect -c parameter. The error
message was misleading. I committed a better message earlier this week.
Synthetic control works reliably with k-Means when the arguments are
given correctly. I think this can be closed.
On 9/24/10 8:34 AM, Sean Owen (JIRA) wrote:
> [ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
>
> Sean Owen updated MAHOUT-504:
> -----------------------------
>
> Assignee: Robin Anil
> Fix Version/s: 0.5
>
>> Kmeans clustering error
>> -----------------------
>>
>> Key: MAHOUT-504
>> URL: https://issues.apache.org/jira/browse/MAHOUT-504
>> Project: Mahout
>> Issue Type: Bug
>> Reporter: Zhen Guo
>> Assignee: Robin Anil
>> Fix For: 0.5
>>
>>
>> I tried the Kmeans algorithm on the Synthetic Control data. The following error appears. I tried the Canopy algorithm, it is fine. This error is from Mapper. I am using Trunk.
>> 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : attempt_201008261432_1324_m_000000_0, Status : FAILED
>> java.lang.IllegalStateException: Cluster is empty!
>> at org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
>> at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
>> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>> at org.apache.hadoop.mapred.Child.main(Child.java:170)