You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Michael McCandless (JIRA)" <ji...@apache.org> on 2016/12/10 00:17:58 UTC
[jira] [Commented] (LUCENE-7589) Prevent outliers from raising the
number of bits of everyone with numeric doc values
[ https://issues.apache.org/jira/browse/LUCENE-7589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15736776#comment-15736776 ]
Michael McCandless commented on LUCENE-7589:
--------------------------------------------
The patch looks great, just this minor typo:
{{ values for te next block.}} --> {{ values for the next block.}}
This seems to give ~3.7% reduction in the doc values disk used for sparse taxis!
> Prevent outliers from raising the number of bits of everyone with numeric doc values
> ------------------------------------------------------------------------------------
>
> Key: LUCENE-7589
> URL: https://issues.apache.org/jira/browse/LUCENE-7589
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Assignee: Adrien Grand
> Priority: Minor
> Attachments: LUCENE-7589.patch
>
>
> Today we encode entire segments with a single number of bits per value. It was done this way because it was faster, but it also means a single outlier can significantly increase the space requirements. I think we should have protection against that.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org