You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Michael McCandless <lu...@mikemccandless.com> on 2010/05/18 16:22:57 UTC

Re: (LUCENE-2257) relax the per-segment max unique term limit

Duh, sorry, that should have been "but on stable (3x) the limit is
across all fields".

On trunk (= flex) the limit is per-field.

Mike

On Tue, May 18, 2010 at 10:12 AM, Koji Sekiguchi <ko...@r.email.ne.jp> wrote:
>  > but in trunk, the limit is across all fields
>
> Got it. Thanks, Mike!
>
> Koji
>
> --
> http://www.rondhuit.com/en/
>
>
> (10/05/18 18:21), Michael McCandless (JIRA) wrote:
>>      [ https://issues.apache.org/jira/browse/LUCENE-2257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12868569#action_12868569 ]
>>
>> Michael McCandless commented on LUCENE-2257:
>> --------------------------------------------
>>
>> Yes, the limit is number of unique terms per-segment.
>>
>> Flex actually increases the limit (the limit is per-field, per-segment; but in trunk, the limit is across all fields).
>>
>>
>>> relax the per-segment max unique term limit
>>> -------------------------------------------
>>>
>>>                  Key: LUCENE-2257
>>>                  URL: https://issues.apache.org/jira/browse/LUCENE-2257
>>>              Project: Lucene - Java
>>>           Issue Type: Improvement
>>>             Reporter: Michael McCandless
>>>             Assignee: Michael McCandless
>>>             Priority: Minor
>>>              Fix For: 2.9.2, 3.0.1, 4.0
>>>
>>>          Attachments: LUCENE-2257.patch, LUCENE-2257.patch
>>>
>>>
>>> Lucene can't handle more than 2.1B (limit of signed 32 bit int) unique terms in a single segment.
>>> But I think we can improve this to termIndexInterval (default 128) * 2.1B.  There is one place (internal API only) where Lucene uses an int but should use a long.
>>>
>>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: dev-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org