You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Sahil Takiar (Jira)" <ji...@apache.org> on 2020/01/29 18:32:00 UTC

[jira] [Updated] (IMPALA-9340) statestore_max_missed_heartbeats is off by one

     [ https://issues.apache.org/jira/browse/IMPALA-9340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sahil Takiar updated IMPALA-9340:
---------------------------------
    Summary: statestore_max_missed_heartbeats is off by one  (was: Statestore statestore_max_missed_heartbeats is off by one)

> statestore_max_missed_heartbeats is off by one
> ----------------------------------------------
>
>                 Key: IMPALA-9340
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9340
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Backend
>            Reporter: Sahil Takiar
>            Priority: Minor
>              Labels: newbie, ramp-up
>
> The flag {{statestore_max_missed_heartbeats}} says:
> {quote}Maximum number of consecutiveĀ heartbeat messages an impalad can miss before being declared failed by theĀ statestore.
> {quote}
> However, the implementation actually waits for {{statestore_max_missed_heartbeats}} + 1 missed heartbeats before considering the impalad as failed.
> Example when {{statestore_max_missed_heartbeats}} is set to 10 (the default value):
> {code:java}
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.214053 29932 failure-detector.cc:90] 1 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is OK
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.267143 29937 failure-detector.cc:90] 2 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is OK
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.320443 29938 failure-detector.cc:90] 3 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is OK
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.373548 29934 failure-detector.cc:90] 4 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is OK
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.426955 29929 failure-detector.cc:90] 5 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is OK
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.479981 29933 failure-detector.cc:90] 6 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is SUSPECTED
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.533097 29930 failure-detector.cc:90] 7 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is SUSPECTED
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.586172 29934 failure-detector.cc:90] 8 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is SUSPECTED
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.639999 29936 failure-detector.cc:90] 9 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is SUSPECTED
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.692075 29929 failure-detector.cc:90] 10 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is SUSPECTED
> logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 10:58:04.745105 29931 failure-detector.cc:90] 11 consecutive heartbeats failed for 'impalad@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. State is FAILED {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org