You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Joe Prasanna Kumar (JIRA)" <ji...@apache.org> on 2010/10/11 01:21:30 UTC

[jira] Commented: (MAHOUT-520) provide build-20news-bayes.sh example (in the same vein as build-reuters.sh)

    [ https://issues.apache.org/jira/browse/MAHOUT-520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12919688#action_12919688 ] 

Joe Prasanna Kumar commented on MAHOUT-520:
-------------------------------------------

Drew,

I was just looking at your patch. Couple of observations.

1. type, ng and source are optional parameters in TrainClassifier and by default they are set to the values you have in the script
2. type, ng, source and method are optional params in TestClassifier and are set to the values you have mentioned
3. after checking if HADOOP_HOME is set, should we run TrainClassifier and TestClassifier in parallel mode by setting the method parameter to mapreduce ? If HADOOP_HOME is not, we could just invoke TrainClassifier and TestClassifier without setting those params so it goes into sequential mode ?

regards
Joe.

> provide build-20news-bayes.sh example (in the same vein as build-reuters.sh)
> ----------------------------------------------------------------------------
>
>                 Key: MAHOUT-520
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-520
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.4
>            Reporter: Drew Farris
>            Assignee: Drew Farris
>            Priority: Minor
>         Attachments: MAHOUT-520.patch
>
>
> provide build-20news-bayes.sh example (in the same vein as build-reuters.sh, that follows https://cwiki.apache.org/MAHOUT/twenty-newsgroups.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.