You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Dave Byrne (JIRA)" <ji...@apache.org> on 2012/09/21 19:02:08 UTC

[jira] [Created] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Dave Byrne created MAHOUT-1072:
----------------------------------

             Summary: FuzzyKmeansDriver emitMostLikely has no effect
                 Key: MAHOUT-1072
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
             Project: Mahout
          Issue Type: Bug
    Affects Versions: 0.8
            Reporter: Dave Byrne
         Attachments: fkmeans_emit.patch

FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paritosh Ranjan reassigned MAHOUT-1072:
---------------------------------------

    Assignee: Paritosh Ranjan
    
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>            Assignee: Paritosh Ranjan
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Dave Byrne (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dave Byrne updated MAHOUT-1072:
-------------------------------

    Attachment: fkmeans_emit.patch

patch
                
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paritosh Ranjan updated MAHOUT-1072:
------------------------------------

       Resolution: Fixed
    Fix Version/s: 0.8
           Status: Resolved  (was: Patch Available)

Patch committed.
                
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>            Assignee: Paritosh Ranjan
>             Fix For: 0.8
>
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Paritosh Ranjan (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461044#comment-13461044 ] 

Paritosh Ranjan commented on MAHOUT-1072:
-----------------------------------------

Now there are two parameters to control outliers, one is threshold and the other one is emitMostLikely.

If possible, I would have liked to get rid of emitMostLikely in future. So, the default should have been false in that case.

Still, since we have not removed emitMostLikely yet, so, it would be good to just pass the value provided by the user for cluster classification.

The current build is not stable, I will commit it once the build becomes stable. 

Thanks.
                
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Jeff Eastman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13461460#comment-13461460 ] 

Jeff Eastman commented on MAHOUT-1072:
--------------------------------------

Seems to me the outlier threshold and emitMostLikely do different things:
- Outlier threshold emits only points with pdf>threshold. This is relevant for all cluster types.
- EmitMostLikely emits only the point with max(pdf) and is only relevant for Dirichlet & FuzzyK that emit all points subject to the threshold above.

I'm not convinced removing EmitMostLikely makes sense but am not hung up and would like to hear more discussion from our user base.
                
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>            Assignee: Paritosh Ranjan
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13471476#comment-13471476 ] 

Hudson commented on MAHOUT-1072:
--------------------------------

Integrated in Mahout-Quality #1690 (See [https://builds.apache.org/job/Mahout-Quality/1690/])
    MAHOUT-1072, Passing emitMostLikely to ClusterClassificationDriver now (Revision 1395479)

     Result = SUCCESS
pranjan : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1395479
Files : 
* /mahout/trunk/core/src/main/java/org/apache/mahout/clustering/fuzzykmeans/FuzzyKMeansDriver.java

                
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>            Assignee: Paritosh Ranjan
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (MAHOUT-1072) FuzzyKmeansDriver emitMostLikely has no effect

Posted by "Dave Byrne (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dave Byrne updated MAHOUT-1072:
-------------------------------

    Status: Patch Available  (was: Open)
    
> FuzzyKmeansDriver emitMostLikely has no effect
> ----------------------------------------------
>
>                 Key: MAHOUT-1072
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1072
>             Project: Mahout
>          Issue Type: Bug
>    Affects Versions: 0.8
>            Reporter: Dave Byrne
>         Attachments: fkmeans_emit.patch
>
>
> FuzzyKmeansDriver.clusterData() has emitMostLikely hardcoded to true and ignores argument passed to function

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira