You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Lance Norskog (Issue Comment Edited) (JIRA)" <ji...@apache.org> on 2011/12/28 10:01:32 UTC

[jira] [Issue Comment Edited] (MAHOUT-884) Matrix Concatenate utility

    [ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13176527#comment-13176527 ] 

Lance Norskog edited comment on MAHOUT-884 at 12/28/11 9:01 AM:
----------------------------------------------------------------

Completely redone. Now a Hadoop job which uses Jake's trick of caching the row widths. It supports any Writable as the key class. Input vectors can be in multiple files and out of order. Supports named vectors.

Minuses:
* Only concatenates two matrices
* Hard-coded to SequentialAccessSparseVector with no compression

                
      was (Author: lancenorskog):
    Completely redone. Now a Hadoop job which uses Jake's trick of caching the row widths.
                  
> Matrix Concatenate utility
> --------------------------
>
>                 Key: MAHOUT-884
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-884
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Integration
>            Reporter: Lance Norskog
>            Priority: Minor
>         Attachments: MAHOUT-884.patch, MAHOUT-884.patch, MAHOUT-884.patch
>
>
> Utility to concatenate matrices stored as SequenceFiles of vectors.
> Each pair in the SequenceFile is the IntWritable row number and a VectorWritable.
> The input and output files may skip rows. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira