You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Robin Anil (JIRA)" <ji...@apache.org> on 2013/06/02 17:06:20 UTC

[jira] [Updated] (MAHOUT-1163) Make random forest classifier meta-data file human readable

     [ https://issues.apache.org/jira/browse/MAHOUT-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Anil updated MAHOUT-1163:
-------------------------------

    Assignee: Ted Dunning
    
> Make random forest classifier meta-data file human readable
> -----------------------------------------------------------
>
>                 Key: MAHOUT-1163
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1163
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.7
>            Reporter: Marty Kube
>            Assignee: Ted Dunning
>             Fix For: 0.8
>
>         Attachments: MAHOUT-1163.patch
>
>
> The RF classifier has as a Describe utility which figures out a description of a data set (how many attributes, types, and enumerated values, etc...) and writes this meta-data to file for later use during training or testing.
> The file format is binary.  That means the only way to generate it is with the Describe utility and it is hard to modify.  If the format was human readable it is then possible to modify/generate the meta-data by hand.
> This will also make it easier to support standard formats such as ARFF.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira