You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hbase.apache.org by "stack (JIRA)" <ji...@apache.org> on 2008/08/07 00:44:44 UTC

[jira] Commented: (HBASE-748) Add an efficient way to batch update many rows

    [ https://issues.apache.org/jira/browse/HBASE-748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12620463#action_12620463 ] 

stack commented on HBASE-748:
-----------------------------

Do you think the HTable client should do the sorting and organizing of edits into batches or should that be done by the calling application?  Hypertable would seem to do the former.  Reading Hypertable user list, it looks like they have a mechanism for buffering up edits in the client.  When the client update buffer is full, it flushes the edits sending in batches with each batch going to the appropriate rangeserver.  There is also an explicit flush which you can call to send the current set of edits.

> Add an efficient way to batch update many rows
> ----------------------------------------------
>
>                 Key: HBASE-748
>                 URL: https://issues.apache.org/jira/browse/HBASE-748
>             Project: Hadoop HBase
>          Issue Type: New Feature
>          Components: client
>    Affects Versions: 0.1.3, 0.2.0
>            Reporter: Jean-Daniel Cryans
>            Assignee: Jean-Daniel Cryans
>             Fix For: 0.3.0
>
>
> HBASE-747 introduced a simple way to batch update many rows. The goal of this issue is to have an enhanced version that will send many rows in a single RPC to each region server. To do this, the client code will have to figure which rows goes to which server, group them accordingly and then send them.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.