You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2008/11/25 01:37:44 UTC

[jira] Commented: (HBASE-1024) Rearchitect regionserver I/O

    [ https://issues.apache.org/jira/browse/HBASE-1024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12650413#action_12650413 ] 

stack commented on HBASE-1024:
------------------------------

Good stuff Andrew.  I concur.  Lets take on these goals.  Shall we put up a wiki page pointing to this issue that talks of rearchitecting project?   and maybe belong in a working document of their own.  Perhaps a page up on wiki -- like http://wiki.apache.org/hadoop/Hbase/NewFileFormat -- with perhaps a link under roadmap or maybe we need an architectural goals section where we stable these targets (and X-them out as we knock them off)?

> Rearchitect regionserver I/O
> ----------------------------
>
>                 Key: HBASE-1024
>                 URL: https://issues.apache.org/jira/browse/HBASE-1024
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: io, regionserver
>            Reporter: Andrew Purtell
>
> Consider supporting:
> * 2GB store files
> * 1TB per node (500 store files)
> * Cell values up to ~100MB
> * Typical use case of RS running with 1GB of heap only
> Some ideas:
> * Drop MapFile and make a custom store file format with (competing) design goals:
>       ** heap efficiency 
>       ** fast lookups
>       ** minimize I/O operations
>       ** optimize for typical DFS blocksizes (8MB, 64MB)
> * MRU cache for filehandles and store file indexes
> * Memory mapped store file indexes -- don't hold the indexes in heap; rely on the OS blockcache for performance
> * "Zero copy" I/O from IPC to store file and vice versa, like NIO buffers
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.