You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Joydeep Sen Sarma (JIRA)" <ji...@apache.org> on 2011/05/07 03:07:03 UTC

[jira] [Commented] (HBASE-3763) Add Bloom Block Index Support

    [ https://issues.apache.org/jira/browse/HBASE-3763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030242#comment-13030242 ] 

Joydeep Sen Sarma commented on HBASE-3763:
------------------------------------------

Dhruba pointed me to some of these jiras.

one quick comment is that _if_ the intention is to keep the filters pinned in memory - then we can convert the load at read time to:
- load at startup time as quickly as possible
- keep the filter pinned in memory when writing out new hfile (never have to read it in)

this would also take out filter reads from client read path.

> Add Bloom Block Index Support
> -----------------------------
>
>                 Key: HBASE-3763
>                 URL: https://issues.apache.org/jira/browse/HBASE-3763
>             Project: HBase
>          Issue Type: Improvement
>          Components: io, regionserver
>    Affects Versions: 0.89.20100924, 0.90.0, 0.90.1, 0.90.2
>            Reporter: Mikhail Bautin
>            Assignee: Mikhail Bautin
>            Priority: Minor
>              Labels: hbase, performance
>             Fix For: 0.89.20100924
>
>   Original Estimate: 0h
>  Remaining Estimate: 0h
>
> Add a way to save HBase Bloom filters into an array of Meta blocks instead of one big Meta block, and load only the blocks required to answer a query.  This will allow us faster bloom load times for large StoreFiles & pave the path for adding Bloom Filter support to HFileOutputFormat bulk load.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira