You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oak-issues@jackrabbit.apache.org by "Chetan Mehrotra (JIRA)" <ji...@apache.org> on 2015/02/02 10:27:34 UTC

[jira] [Updated] (OAK-2469) Restrict the maximum number of terms that will be indexed for a single field

     [ https://issues.apache.org/jira/browse/OAK-2469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chetan Mehrotra updated OAK-2469:
---------------------------------
    Attachment: OAK-2469.patch

Patch for the same.

So far with Lucene 4.x my understanding was there was no actual limit on number of token for a given field. While for 3.x it was probably 10000 and with LUCENE-2295 this change moved to analyzer and by default it was again unlimited

However JR2 used to configure a {{LimitTokenCountAnalyzer}} with 10000 as default.

So with this change we would be changing the default from unlimited -> 10000. We can change the default to unlimited however I would prefer we specify a default of 10000

[~alex.parvulescu] [~teofili]  [~mreutegg] Thoughts?

> Restrict the maximum number of terms that will be indexed for a single field
> ----------------------------------------------------------------------------
>
>                 Key: OAK-2469
>                 URL: https://issues.apache.org/jira/browse/OAK-2469
>             Project: Jackrabbit Oak
>          Issue Type: Improvement
>          Components: oak-lucene
>            Reporter: Chetan Mehrotra
>            Assignee: Chetan Mehrotra
>            Priority: Minor
>             Fix For: 1.2
>
>         Attachments: OAK-2469.patch
>
>
> JR2 used support a {{maxFieldLength}} [1] to limit the numbers of terms indexed per field with default value of 10000. Similar support should be provided in Oak Lucene
> [1] https://github.com/apache/jackrabbit/blob/trunk/jackrabbit-core/src/main/java/org/apache/jackrabbit/core/query/lucene/SearchIndex.java#L299



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)