You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2020/02/20 18:22:00 UTC

[jira] [Commented] (LUCENE-9228) Sort docValues updates by terms before applying

    [ https://issues.apache.org/jira/browse/LUCENE-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17041210#comment-17041210 ] 

ASF subversion and git services commented on LUCENE-9228:
---------------------------------------------------------

Commit a0b8f5c7c28bd897bdf48ed7af20b385a236b3a9 in lucene-solr's branch refs/heads/master from Nhat Nguyen
[ https://gitbox.apache.org/repos/asf?p=lucene-solr.git;h=a0b8f5c ]

LUCENE-9228: Sort dvUpdates by terms before apply

With this change, we sort dvUpdates in the term order before applying if
they all update a single field to the same value. This optimization can
reduce the flush time by around 20% for the docValues update user cases.


> Sort docValues updates by terms before applying
> -----------------------------------------------
>
>                 Key: LUCENE-9228
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9228
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/index
>    Affects Versions: master (9.0), 8.5
>            Reporter: Nhat Nguyen
>            Priority: Major
>          Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> If all DVUpdates update a single field to the same value, then we can apply these updates in the term order instead of the request order as both will yield the same result. This optimization allows us to iterate the term dictionary faster and de-duplicate updates.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org