You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Jan Høydahl (Jira)" <ji...@apache.org> on 2020/05/18 15:00:00 UTC

[jira] [Resolved] (SOLR-1978) Create MappingTokenFilterFactory

     [ https://issues.apache.org/jira/browse/SOLR-1978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jan Høydahl resolved SOLR-1978.
-------------------------------
    Resolution: Won't Do

> Create MappingTokenFilterFactory
> --------------------------------
>
>                 Key: SOLR-1978
>                 URL: https://issues.apache.org/jira/browse/SOLR-1978
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Jan Høydahl
>            Priority: Minor
>
> There is a need for a mapping filter as a counterpart for the MappingCharFilterFactory, but designed to run after tokenization. It should read the same config file format as the MappingCharFilterFactory does.
> This will be a more generic approach to accent normalization than the ISOLatin1AccentFilterFactory which is hard coded.
> The reason why we need it as a TokenFilter is that sometimes the normalization needs to happen far down in the analysis chain, because previous filters rely on the original value of the token, such as stemming, synonyms or other dictionary lookups.
> This patch would require a MappingTokenFilter in Lucene as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org