You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@jena.apache.org by "Andy Seaborne (Jira)" <ji...@apache.org> on 2019/09/22 17:55:00 UTC

[jira] [Comment Edited] (JENA-1759) normalization of lang tags in text query

    [ https://issues.apache.org/jira/browse/JENA-1759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16935381#comment-16935381 ] 

Andy Seaborne edited comment on JENA-1759 at 9/22/19 5:54 PM:
--------------------------------------------------------------


Releated: the parsers have an option to canonicalize language tags, either to lower case or to the RFC canonical form.


was (Author: andy.seaborne):


Releated: the parsers have an option to canonicalize language tags, either to lower case or to the RDF form.

> normalization of lang tags in text query
> ----------------------------------------
>
>                 Key: JENA-1759
>                 URL: https://issues.apache.org/jira/browse/JENA-1759
>             Project: Apache Jena
>          Issue Type: Improvement
>          Components: Jena
>    Affects Versions: Jena 3.12.0
>            Reporter: Elie Roux
>            Priority: Minor
>
> It would be helpful if there was an option in the assembler option of the text query to normalize the lang tags of the query and the lucene indexes. In the current state, a literal with a lang tag \{{zh-Hant}} will not be found by a search on \{{zh-hant}}. I understand that in some cases the distinction might be important (although I have a hard time finding a real life use case), but I think in many cases it's helpful to ignore the case of the lang tag.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)