You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Ted Dunning (JIRA)" <ji...@apache.org> on 2012/09/04 03:10:07 UTC
[jira] [Commented] (MAHOUT-1059) New matrix extensions
[ https://issues.apache.org/jira/browse/MAHOUT-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13447442#comment-13447442 ]
Ted Dunning commented on MAHOUT-1059:
-------------------------------------
I have added a fix/enhancement to the basic matrix to make handling of size caching more generic. The basic issue was that caching the squared length of a vector makes distance calculations with L_2 faster for sparse vectors. This should apply (or not) to all view-like vectors like DelegatingVector. So now it does.
> New matrix extensions
> ---------------------
>
> Key: MAHOUT-1059
> URL: https://issues.apache.org/jira/browse/MAHOUT-1059
> Project: Mahout
> Issue Type: Improvement
> Components: Math
> Reporter: Ted Dunning
> Fix For: 0.8
>
> Attachments: 0001-MAHOUT-1059-Added-Centroid-WeightedVector-Delegating.patch, 0001-MAHOUT-1059-Added-Centroid-WeightedVector-Delegating.patch, 0001-MAHOUT-1059-Added-Centroid-WeightedVector-Delegating.patch, 0001-MAHOUT-1059-Added-Centroid-WeightedVector-Delegating.patch, 0002-MAHOUT-1059-Stylistic-cleanups.patch, 0002-MAHOUT-1059-Stylistic-cleanups.patch, 0002-MAHOUT-1059-Stylistic-cleanups.patch, 0003-MAHOUT-1059-Add-generic-vector-test.patch, 0003-MAHOUT-1059-Add-generic-vector-test.patch, 0004-MAHOUT-1059-Indentation.patch, 0004-MAHOUT-1059-Indentation.patch, 0005-MAHOUT-1059-Abstract-the-idea-of-a-cached-length.patch, 0006-MAHOUT-1059-Additional-test-for-weighted-vectors.patch, DelegatingVectorTest.java
>
>
> The upcoming clustering needs several capabilities to support different operations. These include some matrix extensions for adding behaviors to different kinds of matrices. Also there is a file based matrix that uses mmap to access a file as if it were a matrix in shared memory. Since this is off-heap and shared between processes, it can seriously help some programs.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira