You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sean Owen (JIRA)" <ji...@apache.org> on 2010/01/03 16:41:55 UTC
[jira] Resolved: (MAHOUT-71) Dataset to Matrix Reader
[ https://issues.apache.org/jira/browse/MAHOUT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen resolved MAHOUT-71.
-----------------------------
Resolution: Later
Fix Version/s: (was: 0.3)
Looks like this is inactive now?
> Dataset to Matrix Reader
> ------------------------
>
> Key: MAHOUT-71
> URL: https://issues.apache.org/jira/browse/MAHOUT-71
> Project: Mahout
> Issue Type: New Feature
> Reporter: Deneche A. Hakim
> Assignee: Deneche A. Hakim
> Priority: Minor
>
> This component should allow the input datasets to be read as Matrix Rows.
> A Map-Reduce Algorithm should handle any dataset in a matrix format, where the collumns are the attributes (and one of them is the Label) and the rows are the datas.
> Working with Hadoop, we'll need to pass the dataset in the mapper's input, so it must be a file (or many files). We'll then need a custom InputFormat to feed the mappers with the data, and here comes the lovely-named "row-wise splitting matrix input format".
> Now we want to be able to work with any given dataset file format (including the ARFF and my custom format), and thus the InputFormat needs a decoder that converts the dataset lines into matrix rows.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
Re: [jira] Resolved: (MAHOUT-71) Dataset to Matrix Reader
Posted by deneche abdelhakim <ad...@apache.org>.
yep :p
On Sun, Jan 3, 2010 at 4:41 PM, Sean Owen (JIRA) <ji...@apache.org> wrote:
>
> [ https://issues.apache.org/jira/browse/MAHOUT-71?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
>
> Sean Owen resolved MAHOUT-71.
> -----------------------------
>
> Resolution: Later
> Fix Version/s: (was: 0.3)
>
> Looks like this is inactive now?
>
>> Dataset to Matrix Reader
>> ------------------------
>>
>> Key: MAHOUT-71
>> URL: https://issues.apache.org/jira/browse/MAHOUT-71
>> Project: Mahout
>> Issue Type: New Feature
>> Reporter: Deneche A. Hakim
>> Assignee: Deneche A. Hakim
>> Priority: Minor
>>
>> This component should allow the input datasets to be read as Matrix Rows.
>> A Map-Reduce Algorithm should handle any dataset in a matrix format, where the collumns are the attributes (and one of them is the Label) and the rows are the datas.
>> Working with Hadoop, we'll need to pass the dataset in the mapper's input, so it must be a file (or many files). We'll then need a custom InputFormat to feed the mappers with the data, and here comes the lovely-named "row-wise splitting matrix input format".
>> Now we want to be able to work with any given dataset file format (including the ARFF and my custom format), and thus the InputFormat needs a decoder that converts the dataset lines into matrix rows.
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>