You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2012/07/13 16:25:33 UTC

[jira] [Created] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Joern Kottmann created OPENNLP-524:
--------------------------------------

             Summary: Tokenizer does not load 1.5.0 sourceforge model
                 Key: OPENNLP-524
                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
             Project: OpenNLP
          Issue Type: Improvement
          Components: Tokenizer
            Reporter: Joern Kottmann
             Fix For: tools-1.5.3


I am doing some testing (of trunk) and run into this issue.
The tokenizer refuses to load the model from the sourceforge
site.

I am getting this exception:
Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
    at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
    at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
    at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
    at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
    at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
    at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
    at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
    at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
    at opennlp.tools.cmdline.CLI.main(CLI.java:225)
    ... 6 more
Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
    at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
    at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
    at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
    at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
    ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416284#comment-13416284 ] 

Joern Kottmann commented on OPENNLP-524:
----------------------------------------

Can this fixed by simply removing the check in the TokenizerFactory.validateArtifactMap? It seems it can handle the case where alphaNumericPattern is not available (null) well. 
                
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joern Kottmann updated OPENNLP-524:
-----------------------------------

    Issue Type: Bug  (was: Improvement)
    
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joern Kottmann closed OPENNLP-524.
----------------------------------

    
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "William Colen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

William Colen resolved OPENNLP-524.
-----------------------------------

    Resolution: Fixed

Now it does not abort execution if the property is missing. It will use the default alphanumeric pattern if it is missing in the model.
                
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "William Colen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13416550#comment-13416550 ] 

William Colen commented on OPENNLP-524:
---------------------------------------

Yes, sure, it can be null. We can solve it by removing the validation. I will do it.
                
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Assigned] (OPENNLP-524) Tokenizer does not load 1.5.0 sourceforge model

Posted by "Joern Kottmann (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joern Kottmann reassigned OPENNLP-524:
--------------------------------------

    Assignee: William Colen

Would you mind to have a look here?
                
> Tokenizer does not load 1.5.0 sourceforge model
> -----------------------------------------------
>
>                 Key: OPENNLP-524
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-524
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Tokenizer
>            Reporter: Joern Kottmann
>            Assignee: William Colen
>             Fix For: tools-1.5.3
>
>
> I am doing some testing (of trunk) and run into this issue.
> The tokenizer refuses to load the model from the sourceforge
> site.
> I am getting this exception:
> Caused by: java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:470)
>     at opennlp.tools.util.model.BaseModel.loadModel(BaseModel.java:241)
>     at opennlp.tools.util.model.BaseModel.<init>(BaseModel.java:181)
>     at opennlp.tools.tokenize.TokenizerModel.<init>(TokenizerModel.java:125)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:39)
>     at opennlp.tools.cmdline.tokenizer.TokenizerModelLoader.loadModel(TokenizerModelLoader.java:31)
>     at opennlp.tools.cmdline.ModelLoader.load(ModelLoader.java:62)
>     at opennlp.tools.cmdline.tokenizer.TokenizerMETool.run(TokenizerMETool.java:41)
>     at opennlp.tools.cmdline.CLI.main(CLI.java:225)
>     ... 6 more
> Caused by: opennlp.tools.util.InvalidFormatException: alphaNumericPattern is a mandatory property!
>     at opennlp.tools.tokenize.TokenizerFactory.validateArtifactMap(TokenizerFactory.java:98)
>     at opennlp.tools.util.model.BaseModel.validateArtifactMap(BaseModel.java:451)
>     at opennlp.tools.tokenize.TokenizerModel.validateArtifactMap(TokenizerModel.java:148)
>     at opennlp.tools.util.model.BaseModel.checkArtifactMap(BaseModel.java:468)
>     ... 14 more

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira