You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@opennlp.apache.org by "Joern Kottmann (Created) (JIRA)" <ji...@apache.org> on 2011/09/29 13:08:45 UTC

[jira] [Created] (OPENNLP-303) Name finder should use token annoations from CAS instead of Simple Tokenizer

Name finder should use token annoations from CAS instead of Simple Tokenizer
----------------------------------------------------------------------------

                 Key: OPENNLP-303
                 URL: https://issues.apache.org/jira/browse/OPENNLP-303
             Project: OpenNLP
          Issue Type: Improvement
          Components: Cas Editor OpenNLP Plugin
            Reporter: Joern Kottmann
            Assignee: Joern Kottmann


The name finder currently uses the Simple Tokenizer to split a sentence into its tokens. This does not work in many cases, because the tokenization must be done differently.

To fix this, the name finder plugin should use the token annotations which are in the CAS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Closed] (OPENNLP-303) Name finder should use token annoations from CAS instead of Simple Tokenizer

Posted by "Joern Kottmann (Closed) (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/OPENNLP-303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joern Kottmann closed OPENNLP-303.
----------------------------------

    Resolution: Fixed
    
> Name finder should use token annoations from CAS instead of Simple Tokenizer
> ----------------------------------------------------------------------------
>
>                 Key: OPENNLP-303
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-303
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Cas Editor OpenNLP Plugin
>            Reporter: Joern Kottmann
>            Assignee: Joern Kottmann
>
> The name finder currently uses the Simple Tokenizer to split a sentence into its tokens. This does not work in many cases, because the tokenization must be done differently.
> To fix this, the name finder plugin should use the token annotations which are in the CAS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira