You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2021/04/14 19:01:00 UTC

[jira] [Commented] (LUCENE-9334) Require consistency between data-structures on a per-field basis

    [ https://issues.apache.org/jira/browse/LUCENE-9334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17321255#comment-17321255 ] 

ASF subversion and git services commented on LUCENE-9334:
---------------------------------------------------------

Commit d03662c48bfc5bf2be4840a7f743f9cb64b17fee in lucene's branch refs/heads/main from Mayya Sharipova
[ https://gitbox.apache.org/repos/asf?p=lucene.git;h=d03662c ]

LUCENE-9334 Consistency of field data structures 

Require consistency between data-structures on a per-field basis

A field must be indexed with the same index options and data-structures across
all documents. Thus, for example, it is not allowed to have one document
where a certain field is indexed with doc values and points, and another document 
where the same field is indexed only with points. 
But it is allowed for a document not to have a certain field at all.

As a consequence of this, doc values updates are
only applicable for fields that are indexed with doc values only. 


> Require consistency between data-structures on a per-field basis
> ----------------------------------------------------------------
>
>                 Key: LUCENE-9334
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9334
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Blocker
>             Fix For: main (9.0)
>
>          Time Spent: 14.5h
>  Remaining Estimate: 0h
>
> Follow-up of https://lists.apache.org/thread.html/r747de568afd7502008c45783b74cc3aeb31dab8aa60fcafaf65d5431%40%3Cdev.lucene.apache.org%3E.
> We would like to start requiring consitency across data-structures on a per-field basis in order to make it easier to do the right thing by default: range queries can run faster if doc values are enabled, sorted queries can run faster if points by indexed, etc.
> This would be a big change, so it should be rolled out in a major.
> Strict validation is tricky to implement, but we should still implement best-effort validation:
>  - Documents all use the same data-structures, e.g. it is illegal for a document to only enable points and another document to only enable doc values,
>  - When possible, check whether values are consistent too.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org