You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jena.apache.org by "Osma Suominen (JIRA)" <ji...@apache.org> on 2015/10/29 16:14:27 UTC
[jira] [Created] (JENA-1058) add
ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text
Osma Suominen created JENA-1058:
-----------------------------------
Summary: add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text
Key: JENA-1058
URL: https://issues.apache.org/jira/browse/JENA-1058
Project: Apache Jena
Issue Type: New Feature
Components: Text
Reporter: Osma Suominen
Assignee: Osma Suominen
I'd like to have an Analyzer for jena-text which is otherwise like LowerCaseKeywordAnalyzer that I've implemented before, but also includes the ASCIIFoldingFilter from Lucene. This means that the comparison will ignore accents, so that for example "deja vu" will match "déjà vu".
For some background on why I need this, see https://github.com/NatLibFi/Skosmos/issues/313
I already have an implementation of this ready, will make a PR shortly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)