You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by da...@correo.aeat.es on 2013/11/07 14:09:24 UTC

Problem with size of segments

Hi,

I have an index very big, with 337 G more or less. I am using Solr 4.2.
The problem we have is related with the size of segments: this is the size 
of the biggest ones:
324 G, 3.7G, 3.6 G, 1.6 G, 1.6 G, 465 M ...  We have 
LogByteSizeMergePolicy with 10 as MergeFactor in our solrconfig.

Really the issue is not a problem, but at least I would like to know why 
my segments have this size. According with I have read in papers, if I 
have a MergeFactor of 10 each level within the index should be one order
of magnitude bigger than previously. So , I can't understand why I have a 
segment of 324 G while the others are only of 3 G, this is 2 orders of 
magnitude bigger.

Is this correct or it is a problem with my index?
Where can I read a good explanation about the Merge Policy? 

Thank you very much,

Regards,

David Dávila
AEAT

Re: Problem with size of segments

Posted by Jason Hellman <jh...@innoventsolutions.com>.
David,

I find Mike McCandless’ blog article to be very informative.  Give it a go and let us know if you are still seeking clarification:

http://blog.mikemccandless.com/2011/02/visualizing-lucenes-segment-merges.html

Jason

On Nov 7, 2013, at 5:09 AM, david.davila@correo.aeat.es wrote:

> Hi,
> 
> I have an index very big, with 337 G more or less. I am using Solr 4.2.
> The problem we have is related with the size of segments: this is the size 
> of the biggest ones:
> 324 G, 3.7G, 3.6 G, 1.6 G, 1.6 G, 465 M ...  We have 
> LogByteSizeMergePolicy with 10 as MergeFactor in our solrconfig.
> 
> Really the issue is not a problem, but at least I would like to know why 
> my segments have this size. According with I have read in papers, if I 
> have a MergeFactor of 10 each level within the index should be one order
> of magnitude bigger than previously. So , I can't understand why I have a 
> segment of 324 G while the others are only of 3 G, this is 2 orders of 
> magnitude bigger.
> 
> Is this correct or it is a problem with my index?
> Where can I read a good explanation about the Merge Policy? 
> 
> Thank you very much,
> 
> Regards,
> 
> David Dávila
> AEAT