You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mesos.apache.org by Dick Davies <di...@hellooperator.net> on 2014/11/06 20:02:11 UTC

Re: Problems of running mesos-0.20.0 with zookeeper

The quorum flag is for the number of mesos masters, not zookeepers.

if you only have one master, it's going to have trouble reaching a
quorum of 2 :)

either set --quorum=1 or spin up more masters.

On 6 November 2014 21:01, sujinzhao <su...@gmail.com> wrote:
> Hi,all,
>
> I set up zookeeper service with three machines zoo1, zoo2, zoo3, and also installed 1 mesos master and 2 slaves on another three nodes, I tried to run master and slaves with:
> ./mesos-master.sh --ip=master-ip --zk=zk://zoo1:2181,zoo2:2181,zoo3:2181/mesos --quorum=2
>
> ./mesos-slave.sh --ip=slave-ip --master=zk://zoo1:2181,zoo2:2181,zoo3:2181/mesos
>
> I also created the /mesos znode before running the above commands, but I got the following error:
>
> Recovering from registrar
> Recovering registrar
> Recovery failed: Failed to recover registrar: Failed to perform fetch within 1mins
> *** Check failure stack trace: ***
>     @  0x7f3c1ea105cd google::LogMessage::Fail()
> ...............................
>
> after reading the master log, I found that before causing error, master has already been elected successfully, but the leader failed in recovering from registrar, so I guess this error has little relationship with zookeeper.
>
> after googleing I found that other people also encountered this problem, but with no solution, I also exclude the possible reason of ssh between master/slave and zookeeper servers with no password.
>
> So, could somebody be kindly to tell me how to solve this error? any suggestions will be appreciated.
>
> THANKS.