You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Jörn Kottmann (JIRA)" <ji...@apache.org> on 2011/06/27 10:17:47 UTC

[jira] [Created] (OPENNLP-205) Refactor the SentenceDetectorME class to do the mapping of end-of-sent positions to spans better

Refactor the SentenceDetectorME class to do the mapping of end-of-sent positions to spans better
------------------------------------------------------------------------------------------------

                 Key: OPENNLP-205
                 URL: https://issues.apache.org/jira/browse/OPENNLP-205
             Project: OpenNLP
          Issue Type: Improvement
          Components: Sentence Detector
            Reporter: Jörn Kottmann


The SentenceDectorME class should be refactored to improve the mapping of end-of-sent positions to spans better. The current code tries to eliminate white spaces between to sentences, but this code fails in case the UseTokenEnd option is set to false. If set to true the sentence detector might not work correctly in all cases.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira