You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2013/04/03 13:39:15 UTC
[jira] [Commented] (OPENNLP-564) DeTokenizer Rule File for german
language
[ https://issues.apache.org/jira/browse/OPENNLP-564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13620860#comment-13620860 ]
Joern Kottmann commented on OPENNLP-564:
----------------------------------------
Please open a new issue for the abb_dict contribution. We need to close this one to be able to include it in the 1.5.3 release.
> DeTokenizer Rule File for german language
> -----------------------------------------
>
> Key: OPENNLP-564
> URL: https://issues.apache.org/jira/browse/OPENNLP-564
> Project: OpenNLP
> Issue Type: New Feature
> Components: Tokenizer
> Affects Versions: tools-1.5.2-incubating
> Reporter: Andreas Niekler
> Assignee: Joern Kottmann
> Labels: detokenizer, rules, tokenizer
> Fix For: tools-1.5.3
>
> Attachments: abb_dict.txt, special_char_dict.txt
>
>
> Producing training data for german language needs a pattern file for detokenizer.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira