You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@mahout.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2010/01/03 16:41:55 UTC

[jira] Resolved: (MAHOUT-71) Dataset to Matrix Reader

     [ https://issues.apache.org/jira/browse/MAHOUT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved MAHOUT-71.
-----------------------------

       Resolution: Later
    Fix Version/s:     (was: 0.3)

Looks like this is inactive now?

> Dataset to Matrix Reader
> ------------------------
>
>                 Key: MAHOUT-71
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-71
>             Project: Mahout
>          Issue Type: New Feature
>            Reporter: Deneche A. Hakim
>            Assignee: Deneche A. Hakim
>            Priority: Minor
>
> This component should allow the input datasets to be read as Matrix Rows.
> A Map-Reduce Algorithm should handle any dataset in a matrix format, where the collumns are the attributes (and one of them is the Label) and the rows are the datas.
> Working with Hadoop, we'll need to pass the dataset in the mapper's input, so it must be a file (or many files). We'll then need a custom InputFormat to feed the mappers with the data, and here comes the lovely-named "row-wise splitting matrix input format".
> Now we want to be able to work with any given dataset file format (including the ARFF and my custom format), and thus the InputFormat needs a decoder that converts the dataset lines into matrix rows.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: [jira] Resolved: (MAHOUT-71) Dataset to Matrix Reader

Posted by deneche abdelhakim <ad...@apache.org>.

yep :p

On Sun, Jan 3, 2010 at 4:41 PM, Sean Owen (JIRA) <ji...@apache.org> wrote:
>
>     [ https://issues.apache.org/jira/browse/MAHOUT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
>
> Sean Owen resolved MAHOUT-71.
> -----------------------------
>
>       Resolution: Later
>    Fix Version/s:     (was: 0.3)
>
> Looks like this is inactive now?
>
>> Dataset to Matrix Reader
>> ------------------------
>>
>>                 Key: MAHOUT-71
>>                 URL: https://issues.apache.org/jira/browse/MAHOUT-71
>>             Project: Mahout
>>          Issue Type: New Feature
>>            Reporter: Deneche A. Hakim
>>            Assignee: Deneche A. Hakim
>>            Priority: Minor
>>
>> This component should allow the input datasets to be read as Matrix Rows.
>> A Map-Reduce Algorithm should handle any dataset in a matrix format, where the collumns are the attributes (and one of them is the Label) and the rows are the datas.
>> Working with Hadoop, we'll need to pass the dataset in the mapper's input, so it must be a file (or many files). We'll then need a custom InputFormat to feed the mappers with the data, and here comes the lovely-named "row-wise splitting matrix input format".
>> Now we want to be able to work with any given dataset file format (including the ARFF and my custom format), and thus the InputFormat needs a decoder that converts the dataset lines into matrix rows.
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>