You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@solr.apache.org by "Sridhar (Jira)" <ji...@apache.org> on 2022/08/29 17:45:00 UTC
[jira] [Updated] (SOLR-16359) Reindex without delete old index data

     [ https://issues.apache.org/jira/browse/SOLR-16359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sridhar updated SOLR-16359:
---------------------------
    Description: 
In my existing Solr core index. I am trying to get relevant search results working with or without the words, hyphens. 

Here is the query example for `like` clause: `q=Address:(*{*}5-6*{*}) AND SID:(*{*}584*{*}) AND City:(*{*}brentwood*{*})`

In the configuration file `/conf/managed-schema.xml` I changed the tokenizer to `solr.KeywordTokenizerFactory` from the default tokenizer `solr.StandardTokenizerFactory` on general text filed `text_general`.

    <fieldType name="text_general" class="solr.TextField" positionIncrementGap="100" multiValued="true">
      <analyzer type="index">
        <tokenizer class="solr.KeywordTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />        
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.KeywordTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
        <filter class="solr.SynonymGraphFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldType>

The above configuration is working fine for new `core` index data. But it is not working on the existing `core` index. I had to delete the old index data and reindex from the scratch. *Is there any way to work reindex the data without delete the old data?*

  was:
In my existing Solr core index. I am trying to get relevant search results working with or without the words, hyphens. 

Here is the query example for `like` clause: `q=Address:(*5-6*) AND SID:(*584*) AND City:(*brentwood*)`

In the configuration file `/conf/managed-schema.xml` I changed the tokenizer to `solr.KeywordTokenizerFactory` from the default tokenizer `solr.StandardTokenizerFactory` on general text filed `text_general`.

    <fieldType name="text_general" class="solr.TextField" positionIncrementGap="100" multiValued="true">
      <analyzer type="index">
        <tokenizer class="solr.KeywordTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />        
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.KeywordTokenizerFactory"/>
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
        <filter class="solr.SynonymGraphFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldType>

The above configuration is working fine for new `core` index data. But it is not working on the existing `core` index. I had to delete the old index data and reindex from the scratch. *Is there any way to work reindex the data without delete the old data?*


> Reindex without delete old index data
> -------------------------------------
>
>                 Key: SOLR-16359
>                 URL: https://issues.apache.org/jira/browse/SOLR-16359
>             Project: Solr
>          Issue Type: Task
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Sridhar
>            Priority: Major
>
> In my existing Solr core index. I am trying to get relevant search results working with or without the words, hyphens. 
> Here is the query example for `like` clause: `q=Address:(*{*}5-6*{*}) AND SID:(*{*}584*{*}) AND City:(*{*}brentwood*{*})`
> In the configuration file `/conf/managed-schema.xml` I changed the tokenizer to `solr.KeywordTokenizerFactory` from the default tokenizer `solr.StandardTokenizerFactory` on general text filed `text_general`.
>     <fieldType name="text_general" class="solr.TextField" positionIncrementGap="100" multiValued="true">
>       <analyzer type="index">
>         <tokenizer class="solr.KeywordTokenizerFactory"/>
>         <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />        
>         <filter class="solr.LowerCaseFilterFactory"/>
>       </analyzer>
>       <analyzer type="query">
>         <tokenizer class="solr.KeywordTokenizerFactory"/>
>         <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" />
>         <filter class="solr.SynonymGraphFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
>         <filter class="solr.LowerCaseFilterFactory"/>
>       </analyzer>
>     </fieldType>
> The above configuration is working fine for new `core` index data. But it is not working on the existing `core` index. I had to delete the old index data and reindex from the scratch. *Is there any way to work reindex the data without delete the old data?*



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@solr.apache.org
For additional commands, e-mail: issues-help@solr.apache.org