You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2012/07/17 17:25:34 UTC
[jira] [Commented] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416284#comment-13416284 ]
Joern Kottmann commented on OPENNLP-524:
----------------------------------------
Can this fixed by simply removing the check in the TokenizerFactory.validateArtifactMap? It seems it can handle the case where alphaNumericPattern is not available (null) well.
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira