You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Jörn Kottmann (JIRA)" <ji...@apache.org> on 2011/02/01 13:10:29 UTC

[jira] Created: (OPENNLP-116) Define low level Classifier API which only works on ordered int features

Define low level Classifier API which only works on ordered int features
------------------------------------------------------------------------

                 Key: OPENNLP-116
                 URL: https://issues.apache.org/jira/browse/OPENNLP-116
             Project: OpenNLP
          Issue Type: Improvement
          Components: Maxent
            Reporter: Jörn Kottmann


The maxent/perceptron code currently performs a mapping from String features to low level int features. Most of the code is clearly separated between these two different features, but the separation is not complete. There should be a clearly separated API for 
dealing with high level features and low level features. The API should also contain support to map high level features to low level features.

Goal of the separation is to allow also non-string features to be mapped to the low level int features, non string features could be hash int features, or hash long features. Or a different representation of a string e.g. UTF-8 bytes.

In previous discussions it turned out that having both levels of API are valuable.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira