You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Ioan Barbulescu (JIRA)" <ji...@apache.org> on 2013/09/17 16:36:56 UTC

[jira] [Created] (OPENNLP-597) Code in tools/parser throws some NullPointerExceptions when dealing with poor training data

Ioan Barbulescu created OPENNLP-597:
---------------------------------------

             Summary: Code in tools/parser throws some NullPointerExceptions when dealing with poor training data
                 Key: OPENNLP-597
                 URL: https://issues.apache.org/jira/browse/OPENNLP-597
             Project: OpenNLP
          Issue Type: Bug
          Components: Parser
    Affects Versions: tools-1.5.3
         Environment: Windows 7 + java 1.7.0_21 
            Reporter: Ioan Barbulescu
            Priority: Minor
             Fix For: 1.6.0


I was trying to train the Treebank Parser with some new data.

Truth to be told, the data was in poor format. Specifically, instead of "(-RRB- -RRB-)", it contained "( -RRB-)".
The same for -LRB- constructions.

Due to this input data, the parsing code was throwing some NullPointerException errors.

The fixes consist in some supplementary "if()"s, to safeguard against null pointers.

Fixes are in 3 files, attached as diff. The diff was created by svn, run in the opennlp-tool/.../parser directory.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira