You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Brandon Williams (JIRA)" <ji...@apache.org> on 2010/01/19 06:04:54 UTC

[jira] Created: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

HHOM goes into infinite loop, wasting cpu
-----------------------------------------

                 Key: CASSANDRA-715
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
             Project: Cassandra
          Issue Type: Bug
         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)

            Reporter: Brandon Williams


To replicate: take a host down, cause hints to it, wait for HHOM to kick in

The issue is line 201 of HHOM:
startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it

That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis resolved CASSANDRA-715.
--------------------------------------

    Resolution: Fixed

committed

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>         Attachments: 715.txt
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-715:
-------------------------------------

          Component/s: Core
        Fix Version/s: 0.6
    Affects Version/s: 0.6

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Brandon Williams (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802390#action_12802390 ] 

Brandon Williams commented on CASSANDRA-715:
--------------------------------------------

+1, infinite loop no longer occurs.

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>         Attachments: 715.txt
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis updated CASSANDRA-715:
-------------------------------------

    Attachment: 715.txt

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>         Attachments: 715.txt
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802305#action_12802305 ] 

Jonathan Ellis commented on CASSANDRA-715:
------------------------------------------

until this is fixed, running nodeprobe cleanup on each live node & restarting it should fix this (by removing undelivered hints forcibly).  depending on your replication factor you may have hints that are not removed by cleanup; in that case you can remove the hint files from data/system/*Hint*.

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>             Fix For: 0.6
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803680#action_12803680 ] 

Hudson commented on CASSANDRA-715:
----------------------------------

Integrated in Cassandra #331 (See [http://hudson.zones.apache.org/hudson/job/Cassandra/331/])
    

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>         Attachments: 715.txt
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (CASSANDRA-715) HHOM goes into infinite loop, wasting cpu

Posted by "Jonathan Ellis (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CASSANDRA-715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Ellis reassigned CASSANDRA-715:
----------------------------------------

    Assignee: Jonathan Ellis

> HHOM goes into infinite loop, wasting cpu
> -----------------------------------------
>
>                 Key: CASSANDRA-715
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-715
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.6
>         Environment: debian lenny amd64 OpenJDK 64-Bit Server VM (build 1.6.0_0-b11, mixed mode)
>            Reporter: Brandon Williams
>            Assignee: Jonathan Ellis
>             Fix For: 0.6
>
>
> To replicate: take a host down, cause hints to it, wait for HHOM to kick in
> The issue is line 201 of HHOM:
> startColumn = keyColumn.name(); // repeating the last as the first is fine since we just deleted it
> That comment is false.  The column may not have been deleted, since the endpoint could still be down.  This causes HHOM to go into an infinite loop trying to deliver hints to a down host.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.