You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2012/07/13 16:25:33 UTC
[jira] [Created] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Joern Kottmann created OPENNLP-524:
--------------------------------------
Summary: Tokenizer does not load 1.5.0 sourceforge model
Key: OPENNLP-524
URL: https://issues.apache.org/jira/browse/OPENNLP-524
Project: OpenNLP
Issue Type: Improvement
Components: Tokenizer
Reporter: Joern Kottmann
Fix For: tools-1.5.3
I am doing some testing (of trunk) and run into this issue.
The tokenizer refuses to load the model from the sourceforge
site.
I am getting this exception:
Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
at opennlp.tools.cmdline.CLI.main(CLI.java:225)
... 6 more
Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416284#comment-13416284 ]
Joern Kottmann commented on OPENNLP-524:
----------------------------------------
Can this fixed by simply removing the check in the TokenizerFactory.validateArtifactMap? It seems it can handle the case where alphaNumericPattern is not available (null) well.
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joern Kottmann updated OPENNLP-524:
-----------------------------------
Issue Type: Bug (was: Improvement)
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Closed] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joern Kottmann closed OPENNLP-524.
----------------------------------
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "William Colen (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
William Colen resolved OPENNLP-524.
-----------------------------------
Resolution: Fixed
Now it does not abort execution if the property is missing. It will use the default alphanumeric pattern if it is missing in the model.
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "William Colen (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416550#comment-13416550 ]
William Colen commented on OPENNLP-524:
---------------------------------------
Yes, sure, it can be null. We can solve it by removing the validation. I will do it.
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (OPENNLP-524) Tokenizer does not load 1.5.0
sourceforge model
Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joern Kottmann reassigned OPENNLP-524:
--------------------------------------
Assignee: William Colen
Would you mind to have a look here?
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
> Key: OPENNLP-524
> URL: https://issues.apache.org/jira/browse/OPENNLP-524
> Project: OpenNLP
> Issue Type: Bug
> Components: Tokenizer
> Reporter: Joern Kottmann
> Assignee: William Colen
> Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
> at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
> at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
> at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
> at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
> at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
> at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
> at opennlp.tools.cmdline.CLI.main(CLI.java:225)
> ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
> at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
> at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
> at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
> at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
> ... 14 more
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira