You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Jeff Zemerick (Jira)" <ji...@apache.org> on 2022/10/13 13:45:00 UTC

[jira] [Updated] (OPENNLP-1388) Inconsistency in span.getCoveredText()

     [ https://issues.apache.org/jira/browse/OPENNLP-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jeff Zemerick updated OPENNLP-1388:
-----------------------------------
    Summary: Inconsistency in span.getCoveredText()  (was: Fix span.getCoveredText())

> Inconsistency in span.getCoveredText()
> --------------------------------------
>
>                 Key: OPENNLP-1388
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1388
>             Project: OpenNLP
>          Issue Type: Task
>    Affects Versions: 1.9.4, 2.0.0
>            Reporter: Jeff Zemerick
>            Priority: Major
>
> Span.getCoveredText() is getting the string based on the character start/end and not the token start/end.
> Example:
> string = "Neil Abercrombie Anibal Acevedo-Vila Gary Ackerman"
> span = [0..2) person
> span.getCoveredText(sentence)) returns "Ne" and not "Neil Abercrombie"



--
This message was sent by Atlassian Jira
(v8.20.10#820010)