You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@opennlp.apache.org by "William Colen (JIRA)" <ji...@apache.org> on 2014/04/11 02:33:16 UTC

[jira] [Commented] (OPENNLP-81) Add a cli tool for the doccat evaluation support

    [ https://issues.apache.org/jira/browse/OPENNLP-81?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13966070#comment-13966070 ] 

William Colen commented on OPENNLP-81:
--------------------------------------

Created the evaluation CLI:

$ bin/opennlp DoccatEvaluator
Usage: opennlp DoccatEvaluator[.leipzig] [-reportOutputFile outputFile] [-misclassified true|false] -model model [-detailedF true|false] -data sampleData [-encoding charsetName]

Arguments description:
	-reportOutputFile outputFile
		the path of the fine-grained report file.
	-misclassified true|false
		if true will print false negatives and false positives.
	-model model
		the model file to be evaluated.
	-data sampleData
		data to be used, usually a file name.
	-encoding charsetName
		encoding for reading and writing text, if absent the system default is used.

The reportOutputFile includes F-Measure for each category and a confusion matrix.

> Add a cli tool for the doccat evaluation support
> ------------------------------------------------
>
>                 Key: OPENNLP-81
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-81
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Command Line Interface, Doccat
>            Reporter: Joern Kottmann
>            Priority: Minor
>
> There should be a command line tool which can be used to evaluate the document categorizer model
> on a test file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)