You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Dave Byrne (JIRA)" <ji...@apache.org> on 2012/09/21 19:02:08 UTC
[jira] [Created] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has
no effect
Dave Byrne created MAHOUT-1072:
----------------------------------
Summary: FuzzyKmeansDriver emitMostLikely has no effect
Key: MAHOUT-1072
URL: https://issues.apache.org/jira/browse/MAHOUT-1072
Project: Mahout
Issue Type: Bug
Affects Versions: 0.8
Reporter: Dave Byrne
Attachments: fkmeans_emit.patch
FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely
has no effect
Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Paritosh Ranjan reassigned MAHOUT-1072:
---------------------------------------
Assignee: Paritosh Ranjan
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Assignee: Paritosh Ranjan
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has
no effect
Posted by "Dave Byrne (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dave Byrne updated MAHOUT-1072:
-------------------------------
Attachment: fkmeans_emit.patch
patch
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has
no effect
Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Paritosh Ranjan updated MAHOUT-1072:
------------------------------------
Resolution: Fixed
Fix Version/s: 0.8
Status: Resolved (was: Patch Available)
Patch committed.
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Assignee: Paritosh Ranjan
> Fix For: 0.8
>
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely
has no effect
Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461044#comment-13461044 ]
Paritosh Ranjan commented on MAHOUT-1072:
-----------------------------------------
Now there are two parameters to control outliers, one is threshold and the other one is emitMostLikely.
If possible, I would have liked to get rid of emitMostLikely in future. So, the default should have been false in that case.
Still, since we have not removed emitMostLikely yet, so, it would be good to just pass the value provided by the user for cluster classification.
The current build is not stable, I will commit it once the build becomes stable.
Thanks.
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely
has no effect
Posted by "Jeff Eastman (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461460#comment-13461460 ]
Jeff Eastman commented on MAHOUT-1072:
--------------------------------------
Seems to me the outlier threshold and emitMostLikely do different things:
- Outlier threshold emits only points with pdf>threshold. This is relevant for all cluster types.
- EmitMostLikely emits only the point with max(pdf) and is only relevant for Dirichlet & FuzzyK that emit all points subject to the threshold above.
I'm not convinced removing EmitMostLikely makes sense but am not hung up and would like to hear more discussion from our user base.
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Assignee: Paritosh Ranjan
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely
has no effect
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13471476#comment-13471476 ]
Hudson commented on MAHOUT-1072:
--------------------------------
Integrated in Mahout-Quality #1690 (See [https://builds.apache.org/job/Mahout-Quality/1690/])
MAHOUT-1072, Passing emitMostLikely to ClusterClassificationDriver now (Revision 1395479)
Result = SUCCESS
pranjan : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1395479
Files :
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/fuzzykmeans/FuzzyKMeansDriver.java
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Assignee: Paritosh Ranjan
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has
no effect
Posted by "Dave Byrne (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dave Byrne updated MAHOUT-1072:
-------------------------------
Status: Patch Available (was: Open)
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
> Key: MAHOUT-1072
> URL: https://issues.apache.org/jira/browse/MAHOUT-1072
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.8
> Reporter: Dave Byrne
> Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira