You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2014/05/30 23:50:02 UTC

[jira] [Commented] (LUCENE-5720) Optimize on disk packed integers part 2

    [ https://issues.apache.org/jira/browse/LUCENE-5720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14014261#comment-14014261 ] 

Robert Muir commented on LUCENE-5720:
-------------------------------------

I tried to hack luceneutil up for a performance test, not sure wikipedia 'title' is the best, but i tried on 1M:

Size: 500KB increase in docvalues data (5.7MB -> 6.2MB)
Note that in context, the entire index is 385MB (no stored fields or vectors), so the 500KB docvalues increase is negligible.

20% improvement in sort performance.

> Optimize on disk packed integers part 2
> ---------------------------------------
>
>                 Key: LUCENE-5720
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5720
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Robert Muir
>             Fix For: 4.9, 5.0
>
>         Attachments: LUCENE-5720.patch
>
>
> These are heavily optimized for the in-RAM case (for example FieldCache uses PackedInts.FAST to make it even faster so), but for the docvalues case they are not: we always essentially use COMPACT, we have only one decoder that must solve all the cases, even the complicated ones, we use BlockPackedWriter for all integers (even if they are ordinals), etc.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org