You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Simon Willnauer (JIRA)" <ji...@apache.org> on 2013/03/07 21:59:11 UTC

[jira] [Created] (LUCENE-4817) Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword

Simon Willnauer created LUCENE-4817:
---------------------------------------

             Summary: Add KeywordRepeaterFilter to emit tokens twice once as keyword and once not as keyword
                 Key: LUCENE-4817
                 URL: https://issues.apache.org/jira/browse/LUCENE-4817
             Project: Lucene - Core
          Issue Type: Improvement
          Components: modules/analysis
    Affects Versions: 4.1
            Reporter: Simon Willnauer
            Priority: Minor
             Fix For: 5.0, 4.3


if you want to have a stemmed and an unstemmed version of a token one for recall and one for precision you have to do two fields today in most of the cases. Yet, most of the stemmers respect the keyword attribute so we could add a token filter that emits the same token twice once as keyword and once plain. Folks would most likely need to combine this RemoveDuplicatesTokenFilter but that way we can have stemmed and unstemmed version in the same field.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org