You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Klaus Ma (JIRA)" <ji...@apache.org> on 2015/09/01 08:32:46 UTC
[jira] [Commented] (MESOS-3351) nextSlaveId in master was not
updated when recover
[ https://issues.apache.org/jira/browse/MESOS-3351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14724868#comment-14724868 ]
Klaus Ma commented on MESOS-3351:
---------------------------------
[~vinodkone], I add you as shepherd because it's also a duplicated ID issue.
> nextSlaveId in master was not updated when recover
> --------------------------------------------------
>
> Key: MESOS-3351
> URL: https://issues.apache.org/jira/browse/MESOS-3351
> Project: Mesos
> Issue Type: Bug
> Components: master
> Environment: Mac OS (Darwin da-macbookair.cn.ibm.com 14.5.0 Darwin Kernel Version 14.5.0: Wed Jul 29 02:26:53 PDT 2015; root:xnu-2782.40.9~1/RELEASE_X86_64 x86_64)
> Reporter: Klaus Ma
> Assignee: Klaus Ma
> Labels: race-condition, uuid
> Attachments: test.log
>
>
> When a slave register to master, master will generate a slave ID for it by slaveInfo.id + "-S" + nextSlaveId (in master.cpp) to avoid duplicate slaveInfo.id. But if master failover, nextSlaveId was reset to 0 which may trigger duplicated slaveId between old slave & new slave.
> For now, it's only reproduced in Mac OS unstably, and can NOT reproduce in Ubuntu; not sure the other OS.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)