You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Jeff Zemerick (JIRA)" <ji...@apache.org> on 2016/03/09 19:15:41 UTC
[jira] [Updated] (OPENNLP-837) Let Doccat fail when non-sufficient
amounts of training data are provided for training
[ https://issues.apache.org/jira/browse/OPENNLP-837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jeff Zemerick updated OPENNLP-837:
----------------------------------
Attachment: OPENNLP-837.patch
Attached a patch that throws a new exception (InsufficientTrainingDataException) if the number of unique events is 0. Model creation does not continue in this case.
> Let Doccat fail when non-sufficient amounts of training data are provided for training
> --------------------------------------------------------------------------------------
>
> Key: OPENNLP-837
> URL: https://issues.apache.org/jira/browse/OPENNLP-837
> Project: OpenNLP
> Issue Type: Bug
> Components: Doccat
> Reporter: Tommaso Teofili
> Fix For: 1.6.1
>
> Attachments: OPENNLP-837.patch
>
>
> When the amounts of training data are not sufficient in order to train a Doccat model the user should be made aware of that with an informative message, e.g. a warning when using the command line, an exception when calling the APIs programmatically.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)