You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ambari.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2018/02/16 13:14:04 UTC
[jira] [Commented] (AMBARI-20593) EU/RU Auto-Retry does not
reschedule task when host is not heartbeating before task is scheduled and
doesn't have a start time
[ https://issues.apache.org/jira/browse/AMBARI-20593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16366990#comment-16366990 ]
Hudson commented on AMBARI-20593:
---------------------------------
FAILURE: Integrated in Jenkins build Ambari-trunk-Commit #8744 (See [https://builds.apache.org/job/Ambari-trunk-Commit/8744/])
AMBARI-20593. EU/RU Auto-Retry does not reschedule task when host is not (aonishuk: [https://gitbox.apache.org/repos/asf?p=ambari.git&a=commit&h=7766742fee7468f31bd3d9a3e9099d0325291394])
* (edit) ambari-server/src/test/java/org/apache/ambari/server/state/services/RetryUpgradeActionServiceTest.java
* (edit) ambari-server/src/main/java/org/apache/ambari/server/actionmanager/HostRoleCommand.java
* (edit) ambari-server/src/main/java/org/apache/ambari/server/state/services/RetryUpgradeActionService.java
> EU/RU Auto-Retry does not reschedule task when host is not heartbeating before task is scheduled and doesn't have a start time
> ------------------------------------------------------------------------------------------------------------------------------
>
> Key: AMBARI-20593
> URL: https://issues.apache.org/jira/browse/AMBARI-20593
> Project: Ambari
> Issue Type: Bug
> Components: ambari-server
> Affects Versions: 2.5.0
> Environment: rolling upgrade
> Reporter: Sviatoslav Tereshchenko
> Assignee: Alejandro Fernandez
> Priority: Major
> Labels: rolling_upgrade
> Fix For: 2.5.1
>
> Attachments: AMBARI-20593.branch-2.5.patch, AMBARI-20593.trunk.patch
>
>
> STR:
> 1) Install ambari 2.5.0.1
> In the ambari.properties file, set
> stack.upgrade.auto.retry.timeout.mins=6
> stack.upgrade.auto.retry.check.interval.secs=30
> 2) Install HDP with any set of services
> 3) Add NameNode HA
> 4) Register and install new HDP stack version
> 5) Start RU
> 5) Corrupt one step from Core Masters group (e.g., stop ambari-agent on a node while the command is running)
> Ambari will restart Restarting NN Batch 1
> 6) Fix corrupted step (e.g., start ambari-agent again)
> 7) Corrupt another step from before the command is scheduled (e.g., stop ambari-agent on a node)
> 8) Fix corrupted step (e.g., start ambari-agent agent)
> The expectation is that Ambari Server should schedule the command on the 2nd node. However, because the command never got an original_start_time and start_time, the RetryUpgradeActionService was not able to retry it since it didn't have any timestamps to compare against.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)