You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2013/04/03 13:39:15 UTC

[jira] [Commented] (OPENNLP-564) DeTokenizer Rule File for german language

    [ https://issues.apache.org/jira/browse/OPENNLP-564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13620860#comment-13620860 ] 

Joern Kottmann commented on OPENNLP-564:
----------------------------------------

Please open a new issue for the abb_dict contribution. We need to close this one to be able to include it in the 1.5.3 release.
                
> DeTokenizer Rule File for german language
> -----------------------------------------
>
>                 Key: OPENNLP-564
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-564
>             Project: OpenNLP
>          Issue Type: New Feature
>          Components: Tokenizer
>    Affects Versions: tools-1.5.2-incubating
>            Reporter: Andreas Niekler
>            Assignee: Joern Kottmann
>              Labels: detokenizer, rules, tokenizer
>             Fix For: tools-1.5.3
>
>         Attachments: abb_dict.txt, special_char_dict.txt
>
>
> Producing training data for german language needs a pattern file for detokenizer. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira