You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Julie Tibshirani (Jira)" <ji...@apache.org> on 2021/01/07 23:34:00 UTC

[jira] [Comment Edited] (LUCENE-9657) Unified Highlighter throws too_complex_to_determinize_exception with >288 filter terms

    [ https://issues.apache.org/jira/browse/LUCENE-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17260881#comment-17260881 ] 

Julie Tibshirani edited comment on LUCENE-9657 at 1/7/21, 11:33 PM:
--------------------------------------------------------------------

Hello [~idoub], I'd suggest filing an issue in the [Elasticsearch repo|https://github.com/elastic/elasticsearch] with the reproduction steps. I have some debugging ideas but it'd be best to discuss there, as the set-up is Elasticsearch-specific. If we then determine the problem is really in Lucene, we can file another issue here and dig further.


was (Author: julietibs):
Hello @Isaac Doub, I'd suggest filing an issue in the [Elasticsearch repo|https://github.com/elastic/elasticsearch] with the reproduction steps. I have some debugging ideas but it'd be best to discuss there, as the set-up is Elasticsearch-specific. If we then determine the problem is really in Lucene, we can file another issue here and dig further.

> Unified Highlighter throws too_complex_to_determinize_exception with >288 filter terms
> --------------------------------------------------------------------------------------
>
>                 Key: LUCENE-9657
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9657
>             Project: Lucene - Core
>          Issue Type: Bug
>    Affects Versions: 8.6.2
>            Reporter: Isaac Doub
>            Priority: Major
>
> There seems to be a problem with the Unified Highlighter in Lucene 8.6.2 that is affecting ElasticSearch 7.9.1. If a search is performed with >288 filter terms using the unified highlighter it throws a too_complex_to_determinize_exception error, but if you switch to the plain highlighter it works fine. Alternatively, if you filter on a "copy_to" field instead of the indexed field, it also works.
>  
> This throws the error
> {code:java}
>  {
>     "highlight": {
>         "type": "unified",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  
>  
> But this works fine
> {code:java}
>  {
>     "highlight": {
>         "type": "plain",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  
>  
> Or if I adjust the search to use the copy_to field it works as well (note "id" is now "_id")
> {code:java}
>  {
>     "highlight": {
>         "type": "unified",
>         "fields": {
>             "title": {
>                 "require_field_match": false
>             }
>         }
>     },
>     "query": {
>         "bool": {
>             "must": [{
>                 "query_string": {
>                     "query": "*"
>                 }
>             }],
>             "filter": [{
>                 "bool": {
>                     "must": [{
>                         "terms": {
>                             "_id": [ ">288 terms here" ]
>                         }
>                     }]
>                 }
>             }]
>         }
>     }
> }{code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org