You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Dave DeBarr (JIRA)" <ji...@apache.org> on 2013/11/06 00:00:18 UTC
[jira] [Updated] (MAHOUT-1351) Adding DenseVector support to
AbstractCluster
[ https://issues.apache.org/jira/browse/MAHOUT-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dave DeBarr updated MAHOUT-1351:
--------------------------------
Attachment: MAHOUT-1351.patch
An "svn diff" to resolve issue MAHOUT-1351
> Adding DenseVector support to AbstractCluster
> ---------------------------------------------
>
> Key: MAHOUT-1351
> URL: https://issues.apache.org/jira/browse/MAHOUT-1351
> Project: Mahout
> Issue Type: Improvement
> Components: Clustering
> Affects Versions: 0.8
> Reporter: Dave DeBarr
> Priority: Minor
> Labels: performance
> Fix For: 0.9
>
> Attachments: MAHOUT-1351.patch
>
> Original Estimate: 1h
> Remaining Estimate: 1h
>
> This improvement reduces runtime by 80% when performing k-means clustering of Scale Invariant Feature Transform (SIFT) descriptors to derive visual words for computer vision. Unlike sparse document vectors, SIFT descriptors are dense. This improvement involves updating the org.apache.mahout.clustering.AbstractCluster(Vector point, int id2) constructor to use "point.clone()" instead of "new RandomAccessSparseVector(point)" for creating the centroid. Also added testKMeansSeqJobDenseVector() test for DenseVector processing.
--
This message was sent by Atlassian JIRA
(v6.1#6144)