You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hbase.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2011/08/29 08:08:37 UTC

[jira] [Created] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

RS should periodically ping its HLog pipeline even if no writes are arriving
----------------------------------------------------------------------------

                 Key: HBASE-4274
                 URL: https://issues.apache.org/jira/browse/HBASE-4274
             Project: HBase
          Issue Type: Improvement
          Components: regionserver, wal
    Affects Versions: 0.92.0
            Reporter: Todd Lipcon
            Assignee: Todd Lipcon
            Priority: Critical
             Fix For: 0.92.0


If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

Posted by "Andrew Purtell (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HBASE-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13092698#comment-13092698 ] 

Andrew Purtell commented on HBASE-4274:
---------------------------------------

Doesn't HBASE-4222 already address this? It takes a different approach, and arguably a better one. No need to abort if a new HLog pipeline can be established.

> RS should periodically ping its HLog pipeline even if no writes are arriving
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4274
>                 URL: https://issues.apache.org/jira/browse/HBASE-4274
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.92.0
>
>
> If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

Posted by "stack (Updated) (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HBASE-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

stack updated HBASE-4274:
-------------------------

    Fix Version/s:     (was: 0.92.0)
                   0.94.0

Moving out of 0.92.  This does not seem to be a critical 0.92 issue any more given Gary work.
                
> RS should periodically ping its HLog pipeline even if no writes are arriving
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4274
>                 URL: https://issues.apache.org/jira/browse/HBASE-4274
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.94.0
>
>
> If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

Posted by "Andrew Purtell (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HBASE-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13093327#comment-13093327 ] 

Andrew Purtell commented on HBASE-4274:
---------------------------------------

In general we should opt for strategies that allow the RS to ride over short DFS interruptions, such as a rolling restart of DNs, or a switch reload or failover, or similar. So I lean toward -1 changes that make the RS more aggressive about terminating in such situations as long as we also reason carefully about (avoiding) data loss.

> RS should periodically ping its HLog pipeline even if no writes are arriving
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4274
>                 URL: https://issues.apache.org/jira/browse/HBASE-4274
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.92.0
>
>
> If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

Posted by "Ted Yu (Commented) (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HBASE-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13129147#comment-13129147 ] 

Ted Yu commented on HBASE-4274:
-------------------------------

Gary has addressed rolling restart of DNs.
Can we move this issue to 0.94 ?
                
> RS should periodically ping its HLog pipeline even if no writes are arriving
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4274
>                 URL: https://issues.apache.org/jira/browse/HBASE-4274
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.92.0
>
>
> If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4274) RS should periodically ping its HLog pipeline even if no writes are arriving

Posted by "Lars Hofhansl (Updated) (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HBASE-4274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lars Hofhansl updated HBASE-4274:
---------------------------------

    Fix Version/s:     (was: 0.94.0)
                   0.96.0

No movement, removing from 0.92.
                
> RS should periodically ping its HLog pipeline even if no writes are arriving
> ----------------------------------------------------------------------------
>
>                 Key: HBASE-4274
>                 URL: https://issues.apache.org/jira/browse/HBASE-4274
>             Project: HBase
>          Issue Type: Improvement
>          Components: regionserver, wal
>    Affects Versions: 0.92.0
>            Reporter: Todd Lipcon
>            Assignee: Todd Lipcon
>            Priority: Critical
>             Fix For: 0.96.0
>
>
> If you restart HDFS underneath HBase, when HBase isn't taking any write load, the region servers won't "notice" that there's any problem until the next time they take a write, at which point they will abort (because the pipeline is gone from beneath them). It would be better if they wrote some garbage to their HLog once every few seconds as a sort of keepalive, so they will aggressively abort as soon as there's an issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira