You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "jianping weng (Jira)" <ji...@apache.org> on 2022/02/17 12:59:00 UTC

[jira] [Updated] (LUCENE-10425) Lucene supports bkd binary search and return current index of posting list

     [ https://issues.apache.org/jira/browse/LUCENE-10425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

jianping weng updated LUCENE-10425:
-----------------------------------
    Component/s: core/search
    Description: 
In log scenario, we usually want to know the doc count of documents between every time intervals. One possible optimized method is to sort the docuemt in ascend order according to @timestamp field in one segment. then we can use pr[[https://github.com/apache/lucene/pull/687]] to find out the min/max docId in on time interval.

If there is no other filter query, the doc count of one time interval is (max docId- min docId +1)

if there is only one another term filter query, we can use pr[[https://github.com/apache/lucene/pull/688]] to get the diff value of index, when we call advance(minId) and advance(maxId), the diff value is also the doc count of one time interval

 

> Lucene supports bkd binary search and return current index of posting list
> --------------------------------------------------------------------------
>
>                 Key: LUCENE-10425
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10425
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: core/search
>            Reporter: jianping weng
>            Priority: Major
>
> In log scenario, we usually want to know the doc count of documents between every time intervals. One possible optimized method is to sort the docuemt in ascend order according to @timestamp field in one segment. then we can use pr[[https://github.com/apache/lucene/pull/687]] to find out the min/max docId in on time interval.
> If there is no other filter query, the doc count of one time interval is (max docId- min docId +1)
> if there is only one another term filter query, we can use pr[[https://github.com/apache/lucene/pull/688]] to get the diff value of index, when we call advance(minId) and advance(maxId), the diff value is also the doc count of one time interval
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org