You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2012/06/28 22:26:43 UTC

[jira] [Created] (LUCENE-4179) bulk-compress blocktree term suffix lengths

Robert Muir created LUCENE-4179:
-----------------------------------

             Summary: bulk-compress blocktree term suffix lengths
                 Key: LUCENE-4179
                 URL: https://issues.apache.org/jira/browse/LUCENE-4179
             Project: Lucene - Java
          Issue Type: Improvement
            Reporter: Robert Muir


these should be fairly tiny numbers, it seems wasteful to use a byte-per-term when they typically only need a few bits.

also if we have them in bulk, its a step towards random access.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4179) bulk-compress blocktree term suffix lengths

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir updated LUCENE-4179:
--------------------------------

    Attachment: LUCENE-4179.patch

patch (passes test, but not benchmarked).

i still dont like that we are making the suffix lengths larger by stealing bits for leaf/non-leaf, and we can maybe cleanup a few other things. 

but first we should make sure its smaller and just as fast, or faster and not larger :)
                
> bulk-compress blocktree term suffix lengths
> -------------------------------------------
>
>                 Key: LUCENE-4179
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4179
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Robert Muir
>         Attachments: LUCENE-4179.patch
>
>
> these should be fairly tiny numbers, it seems wasteful to use a byte-per-term when they typically only need a few bits.
> also if we have them in bulk, its a step towards random access.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4179) bulk-compress blocktree term suffix lengths

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir updated LUCENE-4179:
--------------------------------

    Attachment: LUCENE-4179_suffixBlock.patch

faster (at least with 1M wiki), slightly uglier version :)

will run the 10M wiki now... 
                
> bulk-compress blocktree term suffix lengths
> -------------------------------------------
>
>                 Key: LUCENE-4179
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4179
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: Robert Muir
>         Attachments: LUCENE-4179.patch, LUCENE-4179_suffixBlock.patch
>
>
> these should be fairly tiny numbers, it seems wasteful to use a byte-per-term when they typically only need a few bits.
> also if we have them in bulk, its a step towards random access.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org