You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by "Zyan (JIRA)" <ji...@apache.org> on 2018/05/08 14:22:00 UTC

[jira] [Created] (SOLR-12329) Default Analyzer for known and unknown fields

Zyan created SOLR-12329:
---------------------------

             Summary: Default Analyzer for known and unknown fields
                 Key: SOLR-12329
                 URL: https://issues.apache.org/jira/browse/SOLR-12329
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: Schema and Analysis
            Reporter: Zyan


Hi

Is there any way I can set the Tokenizer to Keyword Tokenizer instead of StandardTokenizer by default for all known fields as well as unknown fileds ( In schema less mode ).  Basically I don't want Solr to break the terms I index into multiple terms. It also affects faceting. when I try to facet on a term CPU in an example Data Set ( CPU="Quad 2 Core" , CPU=Intel I3 ) , i get 5 facets ( Quad , 2, Core , Intel and I3 ) which is not the behavior I'm expecting. I'm expecting only 2 facets.

Can it be made configurable too. For example can a user define the Analyzers for all the unknown fields through a wildcard syntax or something

 

Thanks

Zyan



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org