You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "mansur (JIRA)" <ji...@apache.org> on 2014/01/08 10:34:50 UTC
[jira] [Created] (MAHOUT-1391) Possibility to disable confusion
matrix in naive bayes
mansur created MAHOUT-1391:
------------------------------
Summary: Possibility to disable confusion matrix in naive bayes
Key: MAHOUT-1391
URL: https://issues.apache.org/jira/browse/MAHOUT-1391
Project: Mahout
Issue Type: New Feature
Components: Classification
Affects Versions: 1.0
Reporter: mansur
Sometimes confusion matrix is to big and not really necessary.
And there is another case for the possibility:
If you split a dataset with many labels with random selection percent to testdataset and trainingdataset, it could happen, that there are classes/labels in testdata, which do not appear in the trainingdataset. By creating a model with the trainingdata the created labelindex does not include some labels from testdata. Therefore if you test on this model with the testdata, mahout tries to create a confusion matrix with the labels from testdata which are not included in the labelindex and throws an exception.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)