You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Jim Regan (JIRA)" <ji...@apache.org> on 2017/05/24 21:16:04 UTC

[jira] [Commented] (OPENNLP-1078) Irish Sentence Bank format support throws NPE

    [ https://issues.apache.org/jira/browse/OPENNLP-1078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16023704#comment-16023704 ] 

Jim Regan commented on OPENNLP-1078:
------------------------------------

Ok, I've run a quick perl script over the whole sentence bank and there's a fairly significant number of sentences where there are more tokens than lemmas. I'll put in a check to skip those tomorrow, and work with the creator to fix the errors in the original.

> Irish Sentence Bank format support throws NPE
> ---------------------------------------------
>
>                 Key: OPENNLP-1078
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1078
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Formats
>            Reporter: Joern Kottmann
>            Priority: Minor
>             Fix For: 1.8.1
>
>
> It happens when trying to match the surface form with its lemma, on line 245 of
> IrishSentenceBankDocument.java:
> int rsize = flx.get(flexidx).size();



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)