You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Dawid Weiss (Jira)" <ji...@apache.org> on 2021/11/04 10:36:00 UTC

[jira] [Updated] (LUCENE-10220) Add an utility method to get IntervalSource from analyzed text (or token stream)

     [ https://issues.apache.org/jira/browse/LUCENE-10220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dawid Weiss updated LUCENE-10220:
---------------------------------
    Fix Version/s: main (10.0)

> Add an utility method to get IntervalSource from analyzed text (or token stream)
> --------------------------------------------------------------------------------
>
>                 Key: LUCENE-10220
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10220
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Dawid Weiss
>            Assignee: Dawid Weiss
>            Priority: Minor
>             Fix For: main (10.0)
>
>
> The Intervals has a number of utility methods that provide an IntervalSource for tokens, phrases, etc. But it's missing an important bit: an interval source matching tokens that are a result of some string applied to a full analysis chain. This corresponds to actually resides in the index and is hard to predict from the outside.
> This is an important omission in Intervals as a utility class.
> I borrowed the implementation from the then-ASL-licensed Elasticsearch code at: 
> [https://github.com/elastic/elasticsearch/blob/7.10/server/src/main/java/org/elasticsearch/index/query/IntervalBuilder.java#L54-L106]
> I also modified it slightly to fit the static-method-based Lucene API. I also added a small test that showcases how this method can be used in practice (and why it's hard to accomplish the same result with existing methods).
> The only thing I'm not sure is how to attribute Elasticsearch properly - in the notice file, perhaps?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org