You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Ted Dunning (JIRA)" <ji...@apache.org> on 2010/09/24 18:49:35 UTC

[jira] Commented: (MAHOUT-286) Need to be able to run classifiers from non-text input (such as ARFF data)

    [ https://issues.apache.org/jira/browse/MAHOUT-286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12914534#action_12914534 ] 

Ted Dunning commented on MAHOUT-286:
------------------------------------

Aside from the arff part, this has been happening off topic in the SGD side of the world.

> Need to be able to run classifiers from non-text input (such as ARFF data)
> --------------------------------------------------------------------------
>
>                 Key: MAHOUT-286
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-286
>             Project: Mahout
>          Issue Type: Improvement
>    Affects Versions: 0.3
>            Reporter: Ted Dunning
>            Priority: Minor
>             Fix For: 0.5
>
>         Attachments: data.arff, data.training.arff, mahout.log, run.sh, weka.log
>
>
> Martin Haeger wrote this:
> {quote}
> We're experimenting a bit with Weka and Mahout. Our input data is a
> relation in ARFF format (see attached data.training.arff), and we'd
> like to classify it using Mahout. However, it seems (to us, at first)
> that the Mahout classifier.bayes.interfaces.Algorithm interface is
> centered around documents of text, and not general attribute data.
> Thus, running the classifier causes our ARFF data to be interpreted as
> a document of words, with not very useful results (see attached
> mahout.log).
> With Weka, we're able to get the results we want (see attached weka.log).
> Any suggestions for how to get this working?
> {quote}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.