You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Paul Elschot (JIRA)" <ji...@apache.org> on 2013/07/14 12:02:49 UTC

[jira] [Comment Edited] (LUCENE-2750) add Kamikaze 3.0.1 into Lucene

    [ https://issues.apache.org/jira/browse/LUCENE-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13707993#comment-13707993 ] 

Paul Elschot edited comment on LUCENE-2750 at 7/14/13 10:01 AM:
----------------------------------------------------------------

For a moment ignoring the fact Elias-Fano requires ordered input and PFor allows random input:
PFor normally has better compression than Elias-Fano when the exceptional values have a lot more bits than the normal values that fit in the low bits.
In such cases Elias-Fano has to use an upperbound that is too large for effective compression.

Also Elias-Fano could still be improved by adding block decoding and using that when most of the block is needed.

                
      was (Author: paul.elschot@xs4all.nl):
    For a moment ignoring the fact Elias-Fano requires ordered input and PForDelta allows random input:
PForDelta normally has better compression than Elias-Fano when the exceptional values have a lot more bits than the normal values that fit in the low bits.
In such cases Elias-Fano has to use an upperbound that is too large for effective compression.

Also Elias-Fano could still be improved by adding block decoding and using that when most of the block is needed.

                  
> add Kamikaze 3.0.1 into Lucene
> ------------------------------
>
>                 Key: LUCENE-2750
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2750
>             Project: Lucene - Core
>          Issue Type: Sub-task
>          Components: modules/other
>            Reporter: hao yan
>            Assignee: Adrien Grand
>         Attachments: LUCENE-2750.patch
>
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> Kamikaze 3.0.1 is the updated version of Kamikaze 2.0.0. It can achieve significantly better performance then Kamikaze 2.0.0 in terms of both compressed size and decompression speed. The main difference between the two versions is Kamikaze 3.0.x uses the much more efficient implementation of the PForDelta compression algorithm. My goal is to integrate the highly efficient PForDelta implementation into Lucene Codec.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org