You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Robin Swezey (JIRA)" <ji...@apache.org> on 2010/11/14 15:29:14 UTC

[jira] Created: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
-----------------------------------------------------------------------------------------------------------

                 Key: MAHOUT-543
                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
             Project: Mahout
          Issue Type: Bug
          Components: Classification
    Affects Versions: 0.4
         Environment: Mac OS X, Hadoop 0.20.2
            Reporter: Robin Swezey
             Fix For: 0.5


This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)

When launching TestClassifier using:

$MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase

The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.

After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Created: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by Ted Dunning <te...@gmail.com>.
Robin,

Since you see the error clearly, can you add a patch to this very nice
report to make sure that a committer knows what you suggest to fix the
problem very precisely?

On Sun, Nov 14, 2010 at 6:29 AM, Robin Swezey (JIRA) <ji...@apache.org>wrote:

> classifier type and data source not taken into account in
> org.apache.mahout.classifier.bayes.TestClassifier
>
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.5
>
>
> This concerns the latest version of Mahout checked out from the SVN repo (I
> believe 0.4)
>
> When launching TestClassifier using:
>
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes
> -ng 1 -source hbase
>
> The output says that the classifier type is Bayes (not Complementary) and
> the data source is HDFS.
>
> After a quick investigation in TestClassifer.java source, there are
> mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of
> dataSourceOpt
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>

[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Comment: was deleted

(was: Fixes the issue of classifier type and data source command line parameters not taken into account in org.apache.mahout.classifier.bayes.TestClassifier)

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Attachment: MAHOUT-543.patch

Joe,

Thank you for the link. I have reuploaded the file with the proper name.

Regards,
Robin

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: MAHOUT-543.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Attachment:     (was: TestClassifier-type-datasource.patch)

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: MAHOUT-543.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Joe Prasanna Kumar (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931920#action_12931920 ] 

Joe Prasanna Kumar commented on MAHOUT-543:
-------------------------------------------

Robin,

Good catch. I just verified that this issue occurs. Also using the typeOpt / dataSourceOpt instead of  classifierType  / dataSource fixes it. 
Do you want to upload a patch with the fix you have proposed ?

regards,
Joe.

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.5
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Attachment: TestClassifier-type-datasource.patch

Fixes issue of classifier type and data source command line parameters not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: TestClassifier-type-datasource.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Joe Prasanna Kumar (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931944#action_12931944 ] 

Joe Prasanna Kumar commented on MAHOUT-543:
-------------------------------------------

Robin,

Thanks for posting the patch.

A quick thought..
generally we create patch with the issue #. So like MAHOUT-543.patch.
You can refer the guidelines at https://cwiki.apache.org/confluence/display/MAHOUT/How+To+Contribute#HowToContribute-Creatingthepatchfile

regards
Joe.

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: TestClassifier-type-datasource.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Comment: was deleted

(was: Fixes issue of classifier type and data source command line parameters not taken into account in org.apache.mahout.classifier.bayes.TestClassifier)

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: TestClassifier-type-datasource.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

Patch posted which fixes the issue

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>         Attachments: TestClassifier-type-datasource.patch
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-543) classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

Posted by "Robin Swezey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Swezey updated MAHOUT-543:
--------------------------------

    Fix Version/s:     (was: 0.5)
                   0.4
           Status: Patch Available  (was: Open)

Fixes the issue of classifier type and data source command line parameters not taken into account in org.apache.mahout.classifier.bayes.TestClassifier

> classifier type and data source not taken into account in org.apache.mahout.classifier.bayes.TestClassifier
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-543
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-543
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Mac OS X, Hadoop 0.20.2
>            Reporter: Robin Swezey
>             Fix For: 0.4
>
>   Original Estimate: 0.08h
>  Remaining Estimate: 0.08h
>
> This concerns the latest version of Mahout checked out from the SVN repo (I believe 0.4)
> When launching TestClassifier using:
> $MAHOUT_HOME/bin/mahout testclassifier -m model -d test-input -type cbayes -ng 1 -source hbase
> The output says that the classifier type is Bayes (not Complementary) and the data source is HDFS.
> After a quick investigation in TestClassifer.java source, there are mistakes on lines: 142, 147, 148
> with classifierType instead of typeOpt, and dataSource instead of dataSourceOpt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.