You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Ioan Barbulescu (JIRA)" <ji...@apache.org> on 2013/09/17 16:36:56 UTC
[jira] [Created] (OPENNLP-597) Code in tools/parser throws some
NullPointerExceptions when dealing with poor training data
Ioan Barbulescu created OPENNLP-597:
---------------------------------------
Summary: Code in tools/parser throws some NullPointerExceptions when dealing with poor training data
Key: OPENNLP-597
URL: https://issues.apache.org/jira/browse/OPENNLP-597
Project: OpenNLP
Issue Type: Bug
Components: Parser
Affects Versions: tools-1.5.3
Environment: Windows 7 + java 1.7.0_21
Reporter: Ioan Barbulescu
Priority: Minor
Fix For: 1.6.0
I was trying to train the Treebank Parser with some new data.
Truth to be told, the data was in poor format. Specifically, instead of "(-RRB- -RRB-)", it contained "( -RRB-)".
The same for -LRB- constructions.
Due to this input data, the parsing code was throwing some NullPointerException errors.
The fixes consist in some supplementary "if()"s, to safeguard against null pointers.
Fixes are in 3 files, attached as diff. The diff was created by svn, run in the opennlp-tool/.../parser directory.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira