You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2012/12/21 09:03:12 UTC

[jira] [Created] (STANBOL-849) Implement Lucene Tokenizer based LabelTokenizer

Rupert Westenthaler created STANBOL-849:
-------------------------------------------

             Summary: Implement Lucene Tokenizer based LabelTokenizer
                 Key: STANBOL-849
                 URL: https://issues.apache.org/jira/browse/STANBOL-849
             Project: Stanbol
          Issue Type: New Feature
          Components: Engine - Entity Linking
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
            Priority: Minor


Lucene supports Tokenizers for a lot of languages. While the OpenNLP or Whitespace character based Tokenizers are fine for most of the languages this allows users to use special one (e.g. for Chinese the smartcn analyzer package)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira