You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Husain, Yavar" <yh...@firstam.com> on 2012/03/15 16:49:29 UTC

Regarding Indexing Multiple Columns Best Practise

Say I have around 30-40 fields (SQL Table Columns) indexed using Solr from the database. I concatenate those fields into one field by using Solr copyfield directive and than make it default search field which I search.

If at the database level itself I perform concatenation of all those fields into one field and then index that field directly (it will avoid copy operation of Solr of each field to that concatenated field) will it be a indexing performance improvement? I am sure it will be but will it make a big/huge change in indexing running time?

Thanks!!!!
****************************************************************************************** 
This message may contain confidential or proprietary information intended only for the use of the 
addressee(s) named above or may contain information that is legally privileged. If you are 
not the intended addressee, or the person responsible for delivering it to the intended addressee, 
you are hereby notified that reading, disseminating, distributing or copying this message is strictly 
prohibited. If you have received this message by mistake, please immediately notify us by 
replying to the message and delete the original message and any copies immediately thereafter. 

Thank you.- 
******************************************************************************************
FAFLD


RE: Regarding Indexing Multiple Columns Best Practise

Posted by "Husain, Yavar" <yh...@firstam.com>.
Thanks Erick!!

-----Original Message-----
From: Erick Erickson [mailto:erickerickson@gmail.com] 
Sent: Friday, March 16, 2012 6:58 PM
To: solr-user@lucene.apache.org
Subject: Re: Regarding Indexing Multiple Columns Best Practise

I would *guess* you won't notice much/any difference. Note that, if you use a fieldType with the increment gap > 1 (the default is often set to 100), phrase queries (slop) will perform differently depending upon which option you choose.

Best
Erick

On Thu, Mar 15, 2012 at 10:49 AM, Husain, Yavar <yh...@firstam.com> wrote:
> Say I have around 30-40 fields (SQL Table Columns) indexed using Solr from the database. I concatenate those fields into one field by using Solr copyfield directive and than make it default search field which I search.
>
> If at the database level itself I perform concatenation of all those fields into one field and then index that field directly (it will avoid copy operation of Solr of each field to that concatenated field) will it be a indexing performance improvement? I am sure it will be but will it make a big/huge change in indexing running time?
>
> Thanks!!!!
> **********************************************************************
> ******************** This message may contain confidential or 
> proprietary information intended only for the use of the
> addressee(s) named above or may contain information that is legally 
> privileged. If you are not the intended addressee, or the person 
> responsible for delivering it to the intended addressee, you are 
> hereby notified that reading, disseminating, distributing or copying 
> this message is strictly prohibited. If you have received this message by mistake, please immediately notify us by replying to the message and delete the original message and any copies immediately thereafter.
>
> Thank you.-
> **********************************************************************
> ********************
> FAFLD
>

Re: Regarding Indexing Multiple Columns Best Practise

Posted by Erick Erickson <er...@gmail.com>.
I would *guess* you won't notice much/any difference. Note that, if you use
a fieldType with the increment gap > 1 (the default is often set to 100),
phrase queries (slop) will perform differently depending upon which option
you choose.

Best
Erick

On Thu, Mar 15, 2012 at 10:49 AM, Husain, Yavar <yh...@firstam.com> wrote:
> Say I have around 30-40 fields (SQL Table Columns) indexed using Solr from the database. I concatenate those fields into one field by using Solr copyfield directive and than make it default search field which I search.
>
> If at the database level itself I perform concatenation of all those fields into one field and then index that field directly (it will avoid copy operation of Solr of each field to that concatenated field) will it be a indexing performance improvement? I am sure it will be but will it make a big/huge change in indexing running time?
>
> Thanks!!!!
> ******************************************************************************************
> This message may contain confidential or proprietary information intended only for the use of the
> addressee(s) named above or may contain information that is legally privileged. If you are
> not the intended addressee, or the person responsible for delivering it to the intended addressee,
> you are hereby notified that reading, disseminating, distributing or copying this message is strictly
> prohibited. If you have received this message by mistake, please immediately notify us by
> replying to the message and delete the original message and any copies immediately thereafter.
>
> Thank you.-
> ******************************************************************************************
> FAFLD
>