You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Dawid Weiss (Jira)" <ji...@apache.org> on 2022/06/22 20:25:00 UTC

[jira] [Resolved] (LUCENE-10607) NRTSuggesterBuilder扩展input时溢出

     [ https://issues.apache.org/jira/browse/LUCENE-10607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dawid Weiss resolved LUCENE-10607.
----------------------------------
    Fix Version/s: 9.3
       Resolution: Fixed

> NRTSuggesterBuilder扩展input时溢出
> -----------------------------
>
>                 Key: LUCENE-10607
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10607
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: core/FSTs
>    Affects Versions: 9.2
>            Reporter: chaseny
>            Priority: Major
>             Fix For: 9.3
>
>          Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> suggest模块在创建索引时,调用NRTSuggestBuilder的finishTerm来写入suggest索引。
> 会调用maxNumArcsForDedupByte函数来扩展analyzed,向后扩展3 5 7 .... 255。
> 当entries长度过长(9000000)时,调用maxNumArcsForDedupByte扩展时
>  
> private static int maxNumArcsForDedupByte(int currentNumDedupBytes) {
> int maxArcs = 1 + (2 * currentNumDedupBytes);
> if (currentNumDedupBytes > 5)
> { maxArcs *= currentNumDedupBytes;  //当currentNumDedupBytes大于等于32768时,int相乘会大于int最大值 }
> return Math.min(maxArcs, 255);
> }
>  
> 另外在扩展时,是否可以选择固定4字节来有序扩展。代替 3 5 7 ... 255的扩展方式
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org