You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Lars Hofhansl (JIRA)" <ji...@apache.org> on 2012/08/09 23:00:28 UTC

[jira] [Commented] (HBASE-6550) Refactoring ReplicationSink to make it more responsive of cluster health

    [ https://issues.apache.org/jira/browse/HBASE-6550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13432144#comment-13432144 ] 

Lars Hofhansl commented on HBASE-6550:
--------------------------------------

Looks like this should work.

I had something simpler in mind:
# Have a decorated conf (like you do), set client pause/retry and also lower client rpc timeout.
# Create an unmanaged HConnectionImplementation and an Executor
# For each batch create new HTable(connection, executor)
# apply batch
# close create HTable.

Seems that would be more readable...?

                
> Refactoring ReplicationSink to make it more responsive of cluster health
> ------------------------------------------------------------------------
>
>                 Key: HBASE-6550
>                 URL: https://issues.apache.org/jira/browse/HBASE-6550
>             Project: HBase
>          Issue Type: New Feature
>          Components: replication
>            Reporter: Himanshu Vashishtha
>            Assignee: Himanshu Vashishtha
>         Attachments: HBase-6550-v1.patch
>
>
> ReplicationSink replicates the WALEdits in the local cluster. It uses native HBase client to insert the mutations. Sometime, it takes a while to process it (may be due to region splitting, gc pause, etc) and it undergoes the retrial phase. 
> It has two repercussions:
> a) The regionserver handler which is serving the request (till now, a priority handler) is blocked for this period.
> b) The caller may get timed out and it will retry it anyway, but the handler serving the ReplicationSink requests is still working.
> Refactoring ReplicationSink to have the following features:
> a) Making it more configurable (have its own number of retrial limit, connection timeout, etc)
> b) Add a fail fast behavior so that it bails out in case caller is timedout, or any exception in processing the mutation batch.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira