You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Bruno Roustant (Jira)" <ji...@apache.org> on 2020/02/28 09:05:00 UTC

[jira] [Updated] (LUCENE-9245) Reduce AutomatonTermsEnum memory usage

     [ https://issues.apache.org/jira/browse/LUCENE-9245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bruno Roustant updated LUCENE-9245:
-----------------------------------
    Description: Reduce AutomatonTermsEnum memory usage. It uses a long[] with size = number of automaton states, which may be large. Instead we can reduce to short[] with slightly faster performance. (I tried a FixedBitSet, but it requires to clear and this impacts negatively performance)  (was: Reduce AutomatonTermsEnum memory usage. It uses a long[] with size = number of automaton states, which may be large. Instead we can reduce to short[] with slightly faster performance. (I tried a FixedBitSet, but it requires to clear and this impacts negatively performance)

Operations.getCommonPrefixBytesRef() is used by CompiledAutomaton to compute the common suffix for all automatons with infinite language. It can be optimized with lazy structure creation and less operations. This gave some % improvement in my benchmarks (LUCENE-9237).)
        Summary: Reduce AutomatonTermsEnum memory usage  (was: Optimize AutomatonTermsEnum and Operations.getCommonPrefixBytesRef)

> Reduce AutomatonTermsEnum memory usage
> --------------------------------------
>
>                 Key: LUCENE-9245
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9245
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Bruno Roustant
>            Assignee: Bruno Roustant
>            Priority: Minor
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Reduce AutomatonTermsEnum memory usage. It uses a long[] with size = number of automaton states, which may be large. Instead we can reduce to short[] with slightly faster performance. (I tried a FixedBitSet, but it requires to clear and this impacts negatively performance)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org