You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hawq.apache.org by "Radar Lei (JIRA)" <ji...@apache.org> on 2018/04/09 08:28:00 UTC
[jira] [Resolved] (HAWQ-1549) Re-syncing standby fails even when
stop mode is fast
[ https://issues.apache.org/jira/browse/HAWQ-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Radar Lei resolved HAWQ-1549.
-----------------------------
Resolution: Fixed
Fix Version/s: 2.3.0.0-incubating
> Re-syncing standby fails even when stop mode is fast
> -----------------------------------------------------
>
> Key: HAWQ-1549
> URL: https://issues.apache.org/jira/browse/HAWQ-1549
> Project: Apache HAWQ
> Issue Type: Bug
> Components: Command Line Tools, Standby master
> Reporter: Shubham Sharma
> Assignee: Shubham Sharma
> Priority: Major
> Fix For: 2.3.0.0-incubating
>
>
> Recently observed a behaviour while re-syncing standby from hawq command line.
> Here are the reproduction steps -
> 1 - Open a client connection to hawq using psql
> 2 - From a different terminal run command - hawq init standby -n -v -M fast
> 3 - Standby resync fails with error
> {code}
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-There are other connections to this instance, shutdown mode smart aborted
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-Either remove connections, or use 'hawq stop master -M fast' or 'hawq stop master -M immediate'
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-See hawq stop --help for all options
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[ERROR]:-Active connections. Aborting shutdown...
> 20171113:03:49:21:158143 hawq_init:hdp3:gpadmin-[ERROR]:-Stop hawq cluster failed, exit
> {code}
> 4 - When -M (stop mode) is passed it should terminate existing client connections.
> The source of this issue appears to be tools/bin/hawq_ctl method _resync_standby. When this is called the command formation does not include stop_mode options as passed to the arguments.
> {code}
> def _resync_standby(self):
> logger.info("Re-sync standby")
> cmd = "%s; hawq stop master -a;" % source_hawq_env
> check_return_code(local_ssh(cmd, logger), logger, "Stop hawq cluster failed, exit")
> ......
> ......
> {code}
> I can start this and submit a PR when changes are done.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)