You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "David Smiley (JIRA)" <ji...@apache.org> on 2015/11/17 18:40:11 UTC

[jira] [Updated] (LUCENE-6898) Avoid reading last stored field value when StoredFieldVisitor.Status.NO

     [ https://issues.apache.org/jira/browse/LUCENE-6898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

David Smiley updated LUCENE-6898:
---------------------------------
    Attachment: LUCENE-6898.patch

Here's a simple patch.

I have no idea how much this optimization helps, but I imagine for it would help for medium to large docs.

> Avoid reading last stored field value when StoredFieldVisitor.Status.NO
> -----------------------------------------------------------------------
>
>                 Key: LUCENE-6898
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6898
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/codecs
>            Reporter: David Smiley
>            Assignee: David Smiley
>            Priority: Minor
>         Attachments: LUCENE-6898.patch
>
>
> CompressingStoredFieldsReader.visitDocument (line 597) loops through the fields in the input while consulting the StoredFieldVisitor on what to do.  There is a small optimization that could be done on the last loop iteration.  If the visitor returns Status.NO then it should be treated as equivalent to Status.STOP.  As it is now, it will call skipField() which reads needless bytes from the DataInput that won't be used.
> With this optimization in place, it is advisable to put the largest text field last in sequence -- something the user or search platform (e.g. ES/Solr) could do.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org