You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Otis Gospodnetic <ot...@gmail.com> on 2013/10/24 03:03:46 UTC

Global IDF vs. Routing

Hi,

Seeing so much work being put in routing and seeing the recent
questions about the status of global IDF support made me realize, for
the first time really, that with people using routing more and more we
should be seeing more and more issues caused by the lack of global IDF
because routing by definition doesn't randomly and evenly spread data
across shards.

Is this correct or am I missing something and this is in fact not
(such a big) problem?

Thanks,
Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/

Re: Global IDF vs. Routing

Posted by Otis Gospodnetic <ot...@gmail.com>.
Duh, right, right, sorry for the noise.

Otis
--
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/


On Wed, Oct 23, 2013 at 9:13 PM, Yonik Seeley <ys...@gmail.com> wrote:
> On Wed, Oct 23, 2013 at 9:03 PM, Otis Gospodnetic
> <ot...@gmail.com> wrote:
>> Seeing so much work being put in routing and seeing the recent
>> questions about the status of global IDF support made me realize, for
>> the first time really, that with people using routing more and more we
>> should be seeing more and more issues caused by the lack of global IDF
>> because routing by definition doesn't randomly and evenly spread data
>> across shards.
>
> Many people are using routing to partition users data - in this case,
> global IDF would normally not be what you want anyway.
>
> -Yonik

Re: Global IDF vs. Routing

Posted by Yonik Seeley <ys...@gmail.com>.
On Wed, Oct 23, 2013 at 9:03 PM, Otis Gospodnetic
<ot...@gmail.com> wrote:
> Seeing so much work being put in routing and seeing the recent
> questions about the status of global IDF support made me realize, for
> the first time really, that with people using routing more and more we
> should be seeing more and more issues caused by the lack of global IDF
> because routing by definition doesn't randomly and evenly spread data
> across shards.

Many people are using routing to partition users data - in this case,
global IDF would normally not be what you want anyway.

-Yonik