You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jena.apache.org by "Osma Suominen (JIRA)" <ji...@apache.org> on 2015/10/29 16:14:27 UTC

[jira] [Created] (JENA-1058) add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text

Osma Suominen created JENA-1058:
-----------------------------------

             Summary: add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text
                 Key: JENA-1058
                 URL: https://issues.apache.org/jira/browse/JENA-1058
             Project: Apache Jena
          Issue Type: New Feature
          Components: Text
            Reporter: Osma Suominen
            Assignee: Osma Suominen


I'd like to have an Analyzer for jena-text which is otherwise like LowerCaseKeywordAnalyzer that I've implemented before, but also includes the ASCIIFoldingFilter from Lucene. This means that the comparison will ignore accents, so that for example "deja vu" will match "déjà vu".

For some background on why I need this, see https://github.com/NatLibFi/Skosmos/issues/313

I already have an implementation of this ready, will make a PR shortly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)