You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zookeeper.apache.org by Patrick Hunt <ph...@apache.org> on 2014/06/26 02:07:54 UTC

ZooKeeper 3.5.0-alpha planning

Hey folks, we've been talking about it for a while, a few people have
mentioned on the list as well as contacted me personally that they
would like to see some progress on the first 3.5 release. Every
release is a compromise, if we wait for perfection we'll never get
anything out the door. 3.5 has tons of great new features, lots of
hard work, let's get it out in a release so that folks can use it,
test it, and give feedback.

Jenkins jobs have been pretty stable except for the known flakey test
ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
jenkins has also been verifying the code on jdk7 and jdk8.

Here's my thinking again on how we should plan our releases:

I don't think we'll be able to do a 3.5.x-stable for some time. What I
think we should do instead is similar to what we did for 3.4. (this is
also similar to what Hadoop did during their Hadoop 2 release cycle)
Start with a series of alpha releases, something people can run and
test with, once we address all the blockers and feel comfortable with
the apis & remaining jiras we then switch to beta. Once we get some
good feedback we remove the alpha/beta moniker and look at making it
"stable'. At some later point it will become the "current/stable"
release, taking over from 3.4.x.

e.g.
3.5.0-alpha (8 blockers)
3.5.1-alpha (3 blockers)
3.5.2-alpha (0 blockers)
3.5.3-beta (apis locked)
3.5.4-beta
3.5.5-beta
3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
maybe use it for production but we still expect things to shake out)
3.5.7
....
3.5.x - ready to replace 3.4 releases for production use, stable, etc...

There are 8 blockers currently, are any of these something that should
hold up 3.5.0-alpha?

I'll hold open the discussion for a couple days. If folks find this a
reasonable plan I'll start the ball rolling to cut an RC.

Patrick

RE: ZooKeeper 3.5.0-alpha planning

Posted by Rakesh R <ra...@huawei.com>.
I've just gone through the C library code checkins made in last two months (from April).

Following are some of the related issues, probably this may give hints.

Issue			Date
--------------	----------
ZOOKEEPER-1887	April - 16
ZOOKEEPER-1695	April - 30
ZOOKEEPER-1836	May   - 16
ZOOKEEPER-1938	June  - 25

-Rakesh

-----Original Message-----
From: Patrick Hunt [mailto:phunt@apache.org] 
Sent: 01 July 2014 12:00
To: DevZooKeeper
Subject: Re: ZooKeeper 3.5.0-alpha planning

fyi I see it as far back as may 28 (and could have been failing before but we don't have history much before that):

https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2316/console

definitely some changes in there around may 20/21, but no smoking gun...
https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/changes

Perhaps run with valgrind or something like that? We've run it before to good effect...

Patrick

On Mon, Jun 30, 2014 at 11:22 PM, Raúl Gutiérrez Segalés <rg...@itevenworks.net> wrote:
> On 30 June 2014 22:41, Patrick Hunt <ph...@apache.org> wrote:
>
>> I'm seeing quite a few segfault type failures in the c client on 
>> jenkins. That used to be pretty uncommon. Not sure when it started.
>>
>> Here's another example
>>
>> *** glibc detected *** ./zktest-mt: free(): invalid pointer:
>> 0x00002b0a75afd000 ***
>>
>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk
>> /2346/console
>>
>
> So the last big chunk of code that landed in the C client was the 
> removeWatches impl (which still needs an update to adhere to the last 
> changes in the Java impl). Although, its test does pass:
>
> Zookeeper_simpleSystem::testRemoveWatchers ZooKeeper server started :
> elapsed 4634 : OK
>
> but the error is suspiciously near by.. I'll take a look.
>
>
> -rgs
>
>
>
>
>>
>> Patrick
>>
>> On Mon, Jun 30, 2014 at 10:32 PM, Raúl Gutiérrez Segalés 
>> <rg...@itevenworks.net> wrote:
>> > On 30 June 2014 22:26, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >> Also, does anyone have an idea where we stand with the c client 
>> >> and windows support? I see the build job is passing on trunk. Are 
>> >> folks able to successfully use that client?
>> >>
>> >> I see the c client on linux failing in some new ways, recent change?
>> >>
>> >>      [exec] Zookeeper_operations::testConcurrentOperations1 :
>> >> assertion : elapsed 24
>> >>      [exec] /bin/bash: line 5: 11205 Segmentation fault
>> >>
>> >>
>> ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
>> >> CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
>> >>      [exec] Zookeeper_multi::testCreateFAIL: zktest-mt
>> >>
>> >
>> > I wonder if related to:
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1933
>> >
>> >
>> > -rgs
>> >
>> >
>> >
>> >>
>> >> Patrick
>> >>
>> >> On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
>> >> > Hi Flavio, do you think those jiras can get reviewed/finalized 
>> >> > before the end of the week? I'd like to try cutting an RC soonish...
>> >> >
>> >> > Patrick
>> >> >
>> >> > On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira 
>> >> > <fp...@yahoo.com.invalid> wrote:
>> >> >> +1 for the plan of releasing alpha versions.
>> >> >>
>> >> >> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are 
>> >> >> both
>> patch
>> >> available. ZK-1870 is in trunk, but it is still open because we 
>> >> need a
>> 3.4
>> >> patch.
>> >> >>
>> >> >> -Flavio
>> >> >>
>> >> >>
>> >> >> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>> >> >>
>> >> >>> Hey folks, we've been talking about it for a while, a few 
>> >> >>> people
>> have
>> >> >>> mentioned on the list as well as contacted me personally that 
>> >> >>> they would like to see some progress on the first 3.5 release. 
>> >> >>> Every release is a compromise, if we wait for perfection we'll 
>> >> >>> never get anything out the door. 3.5 has tons of great new 
>> >> >>> features, lots of hard work, let's get it out in a release so 
>> >> >>> that folks can use it, test it, and give feedback.
>> >> >>>
>> >> >>> Jenkins jobs have been pretty stable except for the known 
>> >> >>> flakey
>> test
>> >> >>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note 
>> >> >>> that jenkins has also been verifying the code on jdk7 and jdk8.
>> >> >>>
>> >> >>> Here's my thinking again on how we should plan our releases:
>> >> >>>
>> >> >>> I don't think we'll be able to do a 3.5.x-stable for some time.
>> What I
>> >> >>> think we should do instead is similar to what we did for 3.4. 
>> >> >>> (this
>> is
>> >> >>> also similar to what Hadoop did during their Hadoop 2 release 
>> >> >>> cycle) Start with a series of alpha releases, something people 
>> >> >>> can run and test with, once we address all the blockers and 
>> >> >>> feel comfortable
>> with
>> >> >>> the apis & remaining jiras we then switch to beta. Once we get 
>> >> >>> some good feedback we remove the alpha/beta moniker and look 
>> >> >>> at making it "stable'. At some later point it will become the "current/stable"
>> >> >>> release, taking over from 3.4.x.
>> >> >>>
>> >> >>> e.g.
>> >> >>> 3.5.0-alpha (8 blockers)
>> >> >>> 3.5.1-alpha (3 blockers)
>> >> >>> 3.5.2-alpha (0 blockers)
>> >> >>> 3.5.3-beta (apis locked)
>> >> >>> 3.5.4-beta
>> >> >>> 3.5.5-beta
>> >> >>> 3.5.6 (no longer considered alpha/beta but also not "stable" 
>> >> >>> vs
>> 3.4.x,
>> >> >>> maybe use it for production but we still expect things to 
>> >> >>> shake out)
>> >> >>> 3.5.7
>> >> >>> ....
>> >> >>> 3.5.x - ready to replace 3.4 releases for production use, 
>> >> >>> stable,
>> >> etc...
>> >> >>>
>> >> >>> There are 8 blockers currently, are any of these something 
>> >> >>> that
>> should
>> >> >>> hold up 3.5.0-alpha?
>> >> >>>
>> >> >>> I'll hold open the discussion for a couple days. If folks find 
>> >> >>> this
>> a
>> >> >>> reasonable plan I'll start the ball rolling to cut an RC.
>> >> >>>
>> >> >>> Patrick
>> >> >>
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
fyi I see it as far back as may 28 (and could have been failing before
but we don't have history much before that):

https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2316/console

definitely some changes in there around may 20/21, but no smoking gun...
https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/changes

Perhaps run with valgrind or something like that? We've run it before
to good effect...

Patrick

On Mon, Jun 30, 2014 at 11:22 PM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On 30 June 2014 22:41, Patrick Hunt <ph...@apache.org> wrote:
>
>> I'm seeing quite a few segfault type failures in the c client on
>> jenkins. That used to be pretty uncommon. Not sure when it started.
>>
>> Here's another example
>>
>> *** glibc detected *** ./zktest-mt: free(): invalid pointer:
>> 0x00002b0a75afd000 ***
>>
>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2346/console
>>
>
> So the last big chunk of code that landed in the C client was the
> removeWatches impl (which still needs an update to adhere to the last
> changes in the Java impl). Although, its test does pass:
>
> Zookeeper_simpleSystem::testRemoveWatchers ZooKeeper server started :
> elapsed 4634 : OK
>
> but the error is suspiciously near by.. I'll take a look.
>
>
> -rgs
>
>
>
>
>>
>> Patrick
>>
>> On Mon, Jun 30, 2014 at 10:32 PM, Raúl Gutiérrez Segalés
>> <rg...@itevenworks.net> wrote:
>> > On 30 June 2014 22:26, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >> Also, does anyone have an idea where we stand with the c client and
>> >> windows support? I see the build job is passing on trunk. Are folks
>> >> able to successfully use that client?
>> >>
>> >> I see the c client on linux failing in some new ways, recent change?
>> >>
>> >>      [exec] Zookeeper_operations::testConcurrentOperations1 :
>> >> assertion : elapsed 24
>> >>      [exec] /bin/bash: line 5: 11205 Segmentation fault
>> >>
>> >>
>> ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
>> >> CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
>> >>      [exec] Zookeeper_multi::testCreateFAIL: zktest-mt
>> >>
>> >
>> > I wonder if related to:
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1933
>> >
>> >
>> > -rgs
>> >
>> >
>> >
>> >>
>> >> Patrick
>> >>
>> >> On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
>> >> > Hi Flavio, do you think those jiras can get reviewed/finalized before
>> >> > the end of the week? I'd like to try cutting an RC soonish...
>> >> >
>> >> > Patrick
>> >> >
>> >> > On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> >> > <fp...@yahoo.com.invalid> wrote:
>> >> >> +1 for the plan of releasing alpha versions.
>> >> >>
>> >> >> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>> patch
>> >> available. ZK-1870 is in trunk, but it is still open because we need a
>> 3.4
>> >> patch.
>> >> >>
>> >> >> -Flavio
>> >> >>
>> >> >>
>> >> >> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>> >> >>
>> >> >>> Hey folks, we've been talking about it for a while, a few people
>> have
>> >> >>> mentioned on the list as well as contacted me personally that they
>> >> >>> would like to see some progress on the first 3.5 release. Every
>> >> >>> release is a compromise, if we wait for perfection we'll never get
>> >> >>> anything out the door. 3.5 has tons of great new features, lots of
>> >> >>> hard work, let's get it out in a release so that folks can use it,
>> >> >>> test it, and give feedback.
>> >> >>>
>> >> >>> Jenkins jobs have been pretty stable except for the known flakey
>> test
>> >> >>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> >> >>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >> >>>
>> >> >>> Here's my thinking again on how we should plan our releases:
>> >> >>>
>> >> >>> I don't think we'll be able to do a 3.5.x-stable for some time.
>> What I
>> >> >>> think we should do instead is similar to what we did for 3.4. (this
>> is
>> >> >>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>> >> >>> Start with a series of alpha releases, something people can run and
>> >> >>> test with, once we address all the blockers and feel comfortable
>> with
>> >> >>> the apis & remaining jiras we then switch to beta. Once we get some
>> >> >>> good feedback we remove the alpha/beta moniker and look at making it
>> >> >>> "stable'. At some later point it will become the "current/stable"
>> >> >>> release, taking over from 3.4.x.
>> >> >>>
>> >> >>> e.g.
>> >> >>> 3.5.0-alpha (8 blockers)
>> >> >>> 3.5.1-alpha (3 blockers)
>> >> >>> 3.5.2-alpha (0 blockers)
>> >> >>> 3.5.3-beta (apis locked)
>> >> >>> 3.5.4-beta
>> >> >>> 3.5.5-beta
>> >> >>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>> 3.4.x,
>> >> >>> maybe use it for production but we still expect things to shake out)
>> >> >>> 3.5.7
>> >> >>> ....
>> >> >>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>> >> etc...
>> >> >>>
>> >> >>> There are 8 blockers currently, are any of these something that
>> should
>> >> >>> hold up 3.5.0-alpha?
>> >> >>>
>> >> >>> I'll hold open the discussion for a couple days. If folks find this
>> a
>> >> >>> reasonable plan I'll start the ball rolling to cut an RC.
>> >> >>>
>> >> >>> Patrick
>> >> >>
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 30 June 2014 22:41, Patrick Hunt <ph...@apache.org> wrote:

> I'm seeing quite a few segfault type failures in the c client on
> jenkins. That used to be pretty uncommon. Not sure when it started.
>
> Here's another example
>
> *** glibc detected *** ./zktest-mt: free(): invalid pointer:
> 0x00002b0a75afd000 ***
>
>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2346/console
>

So the last big chunk of code that landed in the C client was the
removeWatches impl (which still needs an update to adhere to the last
changes in the Java impl). Although, its test does pass:

Zookeeper_simpleSystem::testRemoveWatchers ZooKeeper server started :
elapsed 4634 : OK

but the error is suspiciously near by.. I'll take a look.


-rgs




>
> Patrick
>
> On Mon, Jun 30, 2014 at 10:32 PM, Raúl Gutiérrez Segalés
> <rg...@itevenworks.net> wrote:
> > On 30 June 2014 22:26, Patrick Hunt <ph...@apache.org> wrote:
> >
> >> Also, does anyone have an idea where we stand with the c client and
> >> windows support? I see the build job is passing on trunk. Are folks
> >> able to successfully use that client?
> >>
> >> I see the c client on linux failing in some new ways, recent change?
> >>
> >>      [exec] Zookeeper_operations::testConcurrentOperations1 :
> >> assertion : elapsed 24
> >>      [exec] /bin/bash: line 5: 11205 Segmentation fault
> >>
> >>
> ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
> >> CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
> >>      [exec] Zookeeper_multi::testCreateFAIL: zktest-mt
> >>
> >
> > I wonder if related to:
> https://issues.apache.org/jira/browse/ZOOKEEPER-1933
> >
> >
> > -rgs
> >
> >
> >
> >>
> >> Patrick
> >>
> >> On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
> >> > Hi Flavio, do you think those jiras can get reviewed/finalized before
> >> > the end of the week? I'd like to try cutting an RC soonish...
> >> >
> >> > Patrick
> >> >
> >> > On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >> > <fp...@yahoo.com.invalid> wrote:
> >> >> +1 for the plan of releasing alpha versions.
> >> >>
> >> >> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
> patch
> >> available. ZK-1870 is in trunk, but it is still open because we need a
> 3.4
> >> patch.
> >> >>
> >> >> -Flavio
> >> >>
> >> >>
> >> >> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
> >> >>
> >> >>> Hey folks, we've been talking about it for a while, a few people
> have
> >> >>> mentioned on the list as well as contacted me personally that they
> >> >>> would like to see some progress on the first 3.5 release. Every
> >> >>> release is a compromise, if we wait for perfection we'll never get
> >> >>> anything out the door. 3.5 has tons of great new features, lots of
> >> >>> hard work, let's get it out in a release so that folks can use it,
> >> >>> test it, and give feedback.
> >> >>>
> >> >>> Jenkins jobs have been pretty stable except for the known flakey
> test
> >> >>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> >> >>> jenkins has also been verifying the code on jdk7 and jdk8.
> >> >>>
> >> >>> Here's my thinking again on how we should plan our releases:
> >> >>>
> >> >>> I don't think we'll be able to do a 3.5.x-stable for some time.
> What I
> >> >>> think we should do instead is similar to what we did for 3.4. (this
> is
> >> >>> also similar to what Hadoop did during their Hadoop 2 release cycle)
> >> >>> Start with a series of alpha releases, something people can run and
> >> >>> test with, once we address all the blockers and feel comfortable
> with
> >> >>> the apis & remaining jiras we then switch to beta. Once we get some
> >> >>> good feedback we remove the alpha/beta moniker and look at making it
> >> >>> "stable'. At some later point it will become the "current/stable"
> >> >>> release, taking over from 3.4.x.
> >> >>>
> >> >>> e.g.
> >> >>> 3.5.0-alpha (8 blockers)
> >> >>> 3.5.1-alpha (3 blockers)
> >> >>> 3.5.2-alpha (0 blockers)
> >> >>> 3.5.3-beta (apis locked)
> >> >>> 3.5.4-beta
> >> >>> 3.5.5-beta
> >> >>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
> 3.4.x,
> >> >>> maybe use it for production but we still expect things to shake out)
> >> >>> 3.5.7
> >> >>> ....
> >> >>> 3.5.x - ready to replace 3.4 releases for production use, stable,
> >> etc...
> >> >>>
> >> >>> There are 8 blockers currently, are any of these something that
> should
> >> >>> hold up 3.5.0-alpha?
> >> >>>
> >> >>> I'll hold open the discussion for a couple days. If folks find this
> a
> >> >>> reasonable plan I'll start the ball rolling to cut an RC.
> >> >>>
> >> >>> Patrick
> >> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I'm seeing quite a few segfault type failures in the c client on
jenkins. That used to be pretty uncommon. Not sure when it started.

Here's another example

*** glibc detected *** ./zktest-mt: free(): invalid pointer:
0x00002b0a75afd000 ***

https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2346/console

Patrick

On Mon, Jun 30, 2014 at 10:32 PM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On 30 June 2014 22:26, Patrick Hunt <ph...@apache.org> wrote:
>
>> Also, does anyone have an idea where we stand with the c client and
>> windows support? I see the build job is passing on trunk. Are folks
>> able to successfully use that client?
>>
>> I see the c client on linux failing in some new ways, recent change?
>>
>>      [exec] Zookeeper_operations::testConcurrentOperations1 :
>> assertion : elapsed 24
>>      [exec] /bin/bash: line 5: 11205 Segmentation fault
>>
>> ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
>> CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
>>      [exec] Zookeeper_multi::testCreateFAIL: zktest-mt
>>
>
> I wonder if related to: https://issues.apache.org/jira/browse/ZOOKEEPER-1933
>
>
> -rgs
>
>
>
>>
>> Patrick
>>
>> On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
>> > Hi Flavio, do you think those jiras can get reviewed/finalized before
>> > the end of the week? I'd like to try cutting an RC soonish...
>> >
>> > Patrick
>> >
>> > On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> > <fp...@yahoo.com.invalid> wrote:
>> >> +1 for the plan of releasing alpha versions.
>> >>
>> >> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch
>> available. ZK-1870 is in trunk, but it is still open because we need a 3.4
>> patch.
>> >>
>> >> -Flavio
>> >>
>> >>
>> >> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>> >>
>> >>> Hey folks, we've been talking about it for a while, a few people have
>> >>> mentioned on the list as well as contacted me personally that they
>> >>> would like to see some progress on the first 3.5 release. Every
>> >>> release is a compromise, if we wait for perfection we'll never get
>> >>> anything out the door. 3.5 has tons of great new features, lots of
>> >>> hard work, let's get it out in a release so that folks can use it,
>> >>> test it, and give feedback.
>> >>>
>> >>> Jenkins jobs have been pretty stable except for the known flakey test
>> >>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> >>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >>>
>> >>> Here's my thinking again on how we should plan our releases:
>> >>>
>> >>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>> >>> think we should do instead is similar to what we did for 3.4. (this is
>> >>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>> >>> Start with a series of alpha releases, something people can run and
>> >>> test with, once we address all the blockers and feel comfortable with
>> >>> the apis & remaining jiras we then switch to beta. Once we get some
>> >>> good feedback we remove the alpha/beta moniker and look at making it
>> >>> "stable'. At some later point it will become the "current/stable"
>> >>> release, taking over from 3.4.x.
>> >>>
>> >>> e.g.
>> >>> 3.5.0-alpha (8 blockers)
>> >>> 3.5.1-alpha (3 blockers)
>> >>> 3.5.2-alpha (0 blockers)
>> >>> 3.5.3-beta (apis locked)
>> >>> 3.5.4-beta
>> >>> 3.5.5-beta
>> >>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>> >>> maybe use it for production but we still expect things to shake out)
>> >>> 3.5.7
>> >>> ....
>> >>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>> etc...
>> >>>
>> >>> There are 8 blockers currently, are any of these something that should
>> >>> hold up 3.5.0-alpha?
>> >>>
>> >>> I'll hold open the discussion for a couple days. If folks find this a
>> >>> reasonable plan I'll start the ball rolling to cut an RC.
>> >>>
>> >>> Patrick
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 30 June 2014 22:26, Patrick Hunt <ph...@apache.org> wrote:

> Also, does anyone have an idea where we stand with the c client and
> windows support? I see the build job is passing on trunk. Are folks
> able to successfully use that client?
>
> I see the c client on linux failing in some new ways, recent change?
>
>      [exec] Zookeeper_operations::testConcurrentOperations1 :
> assertion : elapsed 24
>      [exec] /bin/bash: line 5: 11205 Segmentation fault
>
> ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
> CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
>      [exec] Zookeeper_multi::testCreateFAIL: zktest-mt
>

I wonder if related to: https://issues.apache.org/jira/browse/ZOOKEEPER-1933


-rgs



>
> Patrick
>
> On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
> > Hi Flavio, do you think those jiras can get reviewed/finalized before
> > the end of the week? I'd like to try cutting an RC soonish...
> >
> > Patrick
> >
> > On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> > <fp...@yahoo.com.invalid> wrote:
> >> +1 for the plan of releasing alpha versions.
> >>
> >> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch
> available. ZK-1870 is in trunk, but it is still open because we need a 3.4
> patch.
> >>
> >> -Flavio
> >>
> >>
> >> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
> >>
> >>> Hey folks, we've been talking about it for a while, a few people have
> >>> mentioned on the list as well as contacted me personally that they
> >>> would like to see some progress on the first 3.5 release. Every
> >>> release is a compromise, if we wait for perfection we'll never get
> >>> anything out the door. 3.5 has tons of great new features, lots of
> >>> hard work, let's get it out in a release so that folks can use it,
> >>> test it, and give feedback.
> >>>
> >>> Jenkins jobs have been pretty stable except for the known flakey test
> >>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> >>> jenkins has also been verifying the code on jdk7 and jdk8.
> >>>
> >>> Here's my thinking again on how we should plan our releases:
> >>>
> >>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
> >>> think we should do instead is similar to what we did for 3.4. (this is
> >>> also similar to what Hadoop did during their Hadoop 2 release cycle)
> >>> Start with a series of alpha releases, something people can run and
> >>> test with, once we address all the blockers and feel comfortable with
> >>> the apis & remaining jiras we then switch to beta. Once we get some
> >>> good feedback we remove the alpha/beta moniker and look at making it
> >>> "stable'. At some later point it will become the "current/stable"
> >>> release, taking over from 3.4.x.
> >>>
> >>> e.g.
> >>> 3.5.0-alpha (8 blockers)
> >>> 3.5.1-alpha (3 blockers)
> >>> 3.5.2-alpha (0 blockers)
> >>> 3.5.3-beta (apis locked)
> >>> 3.5.4-beta
> >>> 3.5.5-beta
> >>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
> >>> maybe use it for production but we still expect things to shake out)
> >>> 3.5.7
> >>> ....
> >>> 3.5.x - ready to replace 3.4 releases for production use, stable,
> etc...
> >>>
> >>> There are 8 blockers currently, are any of these something that should
> >>> hold up 3.5.0-alpha?
> >>>
> >>> I'll hold open the discussion for a couple days. If folks find this a
> >>> reasonable plan I'll start the ball rolling to cut an RC.
> >>>
> >>> Patrick
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Also, does anyone have an idea where we stand with the c client and
windows support? I see the build job is passing on trunk. Are folks
able to successfully use that client?

I see the c client on linux failing in some new ways, recent change?

     [exec] Zookeeper_operations::testConcurrentOperations1 :
assertion : elapsed 24
     [exec] /bin/bash: line 5: 11205 Segmentation fault
ZKROOT=/home/jenkins/jenkins-slave/workspace/ZooKeeper-trunk/trunk/src/c/../..
CLASSPATH=$CLASSPATH:$CLOVER_HOME/lib/clover.jar ${dir}$tst
     [exec] Zookeeper_multi::testCreateFAIL: zktest-mt

Patrick

On Mon, Jun 30, 2014 at 9:53 PM, Patrick Hunt <ph...@apache.org> wrote:
> Hi Flavio, do you think those jiras can get reviewed/finalized before
> the end of the week? I'd like to try cutting an RC soonish...
>
> Patrick
>
> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> <fp...@yahoo.com.invalid> wrote:
>> +1 for the plan of releasing alpha versions.
>>
>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.
>>
>> -Flavio
>>
>>
>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> Hey folks, we've been talking about it for a while, a few people have
>>> mentioned on the list as well as contacted me personally that they
>>> would like to see some progress on the first 3.5 release. Every
>>> release is a compromise, if we wait for perfection we'll never get
>>> anything out the door. 3.5 has tons of great new features, lots of
>>> hard work, let's get it out in a release so that folks can use it,
>>> test it, and give feedback.
>>>
>>> Jenkins jobs have been pretty stable except for the known flakey test
>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>
>>> Here's my thinking again on how we should plan our releases:
>>>
>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>>> think we should do instead is similar to what we did for 3.4. (this is
>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>>> Start with a series of alpha releases, something people can run and
>>> test with, once we address all the blockers and feel comfortable with
>>> the apis & remaining jiras we then switch to beta. Once we get some
>>> good feedback we remove the alpha/beta moniker and look at making it
>>> "stable'. At some later point it will become the "current/stable"
>>> release, taking over from 3.4.x.
>>>
>>> e.g.
>>> 3.5.0-alpha (8 blockers)
>>> 3.5.1-alpha (3 blockers)
>>> 3.5.2-alpha (0 blockers)
>>> 3.5.3-beta (apis locked)
>>> 3.5.4-beta
>>> 3.5.5-beta
>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>>> maybe use it for production but we still expect things to shake out)
>>> 3.5.7
>>> ....
>>> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>>>
>>> There are 8 blockers currently, are any of these something that should
>>> hold up 3.5.0-alpha?
>>>
>>> I'll hold open the discussion for a couple days. If folks find this a
>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>
>>> Patrick
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Flavio Junqueira <fp...@yahoo.com.INVALID>.
> Is it possible that multiple tests are running at the same time and
> interfere with each other ?


It wouldn't be the first time we have such problems with test cases overlapping. 

-Flavio

On 26 Jul 2014, at 01:00, Alexander Shraer <sh...@gmail.com> wrote:

> Looking at the log Patrick sent:
> https://builds.apache.org/job/ZooKeeper-trunk/2386/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
> 
> This is really interesting. I see that server 2 boots and then the
> following happens:
> 
> 2014-07-25 17:12:01,159 [myid:2] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> (message format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round),
> LOOKING (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LOOKING (my state)0
> (n.config version)
> 2014-07-25 17:12:01,161 [myid:1] - INFO
> [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> (message format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round),
> LOOKING (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LEADING (my state)0
> (n.config version)
> 2014-07-25 17:12:01,163 [myid:2] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
> - 2 Received version: 200000000 my version: 0
> 
> 
> This just doesn't make sense. First, I start it with config 100000000
> so it can't have "0 (n.config version)", second no one even
> established the first config, so who's sending it version 200000000 ?
> This is the first time it appears in he log.
> 
> 
> Is it possible that multiple tests are running at the same time and
> interfere with each other ?
> 
> 
> I'm not sure how to debug this without access to the machine and
> adding more debug messages.
> 
> 
> 
> Alex
> 
> 
> 
> 
> On Fri, Jul 25, 2014 at 11:28 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
> 
>> Hi,
>> 
>> Hongchao, could you please clarify what you propose ?
>> 
>> If I understand correctly we have two main options - ZK-1989 that disables
>> reconfig and keeps the old static config and a simpler fix which is to
>> check for client ports on boot and shut down the server with an error if no
>> ports were found.
>> 
>> Patrick, I'll take a look on the log. Michi and I didn't succeed to get
>> access to the build machine, which makes it very difficult to debug...
>> 
>> Thanks,
>> Alex
>> 
>> 
>> On Fri, Jul 25, 2014 at 10:53 AM, Hongchao Deng <hd...@cloudera.com>
>> wrote:
>> 
>>> ZK-1989 gets pretty complicated if it needs to support the full backward
>>> compatibility.
>>> 
>>> My plan is divide a small task out: simply keep the old config and make it
>>> work. There could be unexpected cases when users of old config tried to
>>> use
>>> reconfig.
>>> 
>>> Is it okay for the first alpha release?
>>> 
>>> 
>>> On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:
>>> 
>>>> 1974 has been committed (kudos folks!), along with a few other patches
>>>> that were ready to go.
>>>> 
>>>> Hongchao, how is 1989 coming?
>>>> 
>>>> Patrick
>>>> 
>>>> 
>>>> On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>>> Can someone take a look at this issue? The windows c client build is
>>>>> failing for a while now, would be great to fix this for 3.5.0...
>>>>> 
>>>>> ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external
>>>> symbol"
>>>>> 
>>>>> Patrick
>>>>> 
>>>>> On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
>>>>> <rg...@itevenworks.net> wrote:
>>>>>> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>>>>>> 
>>>>>>> We've identified the issues with 1987, it would be good if folks
>>> could
>>>>>>> take a look.
>>>>>> 
>>>>>> 
>>>>>> Great - thanks Patrick. Added some comments to the patch.
>>>>>> 
>>>>>> 
>>>>>>> Nothing looks unsolvable, but we should tweak things a
>>>>>>> bit before 3.5.0, esp given the current upgrade experience. The new
>>>>>>> docs will help a lot - see
>>>>>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need
>>> to
>>>>>>> review and commit.
>>>>>>> 
>>>>>> 
>>>>>> Reading the docs, will follow-up with comments.
>>>>>> 
>>>>>> 
>>>>>> -rgs
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> *- Hongchao Deng*
>>> *Software Engineer, CCE*
>>> 
>> 
>> 


Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
The 3.5.0-alpha release candidate has been posted to the dev list -
thanks everyone for the hard work!

Now the hard work starts. ;-) We need to test the RC and vote, please
do so and comment on the RC thread if you see any issues.

Note: I've also updated all the unresolved jiras, moving any
stragglers from 3.5.0 to 3.5.1. If we find something that should hold
up a 3.5.0 release please mark it as such in jira, otw mark it 3.5.1
(or otw as appropriate).

Regards,

Patrick

On Fri, Jul 25, 2014 at 5:00 PM, Alexander Shraer <sh...@gmail.com> wrote:
> Looking at the log Patrick sent:
> https://builds.apache.org/job/ZooKeeper-trunk/2386/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>
> This is really interesting. I see that server 2 boots and then the following
> happens:
>
>
> 2014-07-25 17:12:01,159 [myid:2] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2 (message
> format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round), LOOKING
> (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LOOKING (my state)0 (n.config
> version)
> 2014-07-25 17:12:01,161 [myid:1] - INFO
> [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2 (message
> format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round), LOOKING
> (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LEADING (my state)0 (n.config
> version)
> 2014-07-25 17:12:01,163 [myid:2] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293] - 2
> Received version: 200000000 my version: 0
>
>
> This just doesn't make sense. First, I start it with config 100000000 so it
> can't have "0 (n.config version)", second no one even established the first
> config, so who's sending it version 200000000 ? This is the first time it
> appears in he log.
>
>
>
> Is it possible that multiple tests are running at the same time and
> interfere with each other ?
>
>
> I'm not sure how to debug this without access to the machine and adding more
> debug messages.
>
>
>
>
> Alex
>
>
>
>
> On Fri, Jul 25, 2014 at 11:28 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
>>
>> Hi,
>>
>> Hongchao, could you please clarify what you propose ?
>>
>> If I understand correctly we have two main options - ZK-1989 that disables
>> reconfig and keeps the old static config and a simpler fix which is to check
>> for client ports on boot and shut down the server with an error if no ports
>> were found.
>>
>> Patrick, I'll take a look on the log. Michi and I didn't succeed to get
>> access to the build machine, which makes it very difficult to debug...
>>
>> Thanks,
>> Alex
>>
>>
>> On Fri, Jul 25, 2014 at 10:53 AM, Hongchao Deng <hd...@cloudera.com>
>> wrote:
>>>
>>> ZK-1989 gets pretty complicated if it needs to support the full backward
>>> compatibility.
>>>
>>> My plan is divide a small task out: simply keep the old config and make
>>> it
>>> work. There could be unexpected cases when users of old config tried to
>>> use
>>> reconfig.
>>>
>>> Is it okay for the first alpha release?
>>>
>>>
>>> On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:
>>>
>>> > 1974 has been committed (kudos folks!), along with a few other patches
>>> > that were ready to go.
>>> >
>>> > Hongchao, how is 1989 coming?
>>> >
>>> > Patrick
>>> >
>>> >
>>> > On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
>>> > > Can someone take a look at this issue? The windows c client build is
>>> > > failing for a while now, would be great to fix this for 3.5.0...
>>> > >
>>> > > ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved
>>> > > external
>>> > symbol"
>>> > >
>>> > > Patrick
>>> > >
>>> > > On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
>>> > > <rg...@itevenworks.net> wrote:
>>> > >> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>>> > >>
>>> > >>> We've identified the issues with 1987, it would be good if folks
>>> > >>> could
>>> > >>> take a look.
>>> > >>
>>> > >>
>>> > >> Great - thanks Patrick. Added some comments to the patch.
>>> > >>
>>> > >>
>>> > >>> Nothing looks unsolvable, but we should tweak things a
>>> > >>> bit before 3.5.0, esp given the current upgrade experience. The new
>>> > >>> docs will help a lot - see
>>> > >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need
>>> > >>> to
>>> > >>> review and commit.
>>> > >>>
>>> > >>
>>> > >> Reading the docs, will follow-up with comments.
>>> > >>
>>> > >>
>>> > >> -rgs
>>> >
>>>
>>>
>>>
>>> --
>>> *- Hongchao Deng*
>>> *Software Engineer, CCE*
>>
>>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Looking at the log Patrick sent:
https://builds.apache.org/job/ZooKeeper-trunk/2386/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/

This is really interesting. I see that server 2 boots and then the
following happens:

2014-07-25 17:12:01,159 [myid:2] - INFO
[WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
(message format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round),
LOOKING (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LOOKING (my state)0
(n.config version)
2014-07-25 17:12:01,161 [myid:1] - INFO
[WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
(message format version), 2 (n.leader), 0x0 (n.zxid), 0x1 (n.round),
LOOKING (n.state), 2 (n.sid), 0x0 (n.peerEPoch), LEADING (my state)0
(n.config version)
2014-07-25 17:12:01,163 [myid:2] - INFO
[WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
- 2 Received version: 200000000 my version: 0


This just doesn't make sense. First, I start it with config 100000000
so it can't have "0 (n.config version)", second no one even
established the first config, so who's sending it version 200000000 ?
This is the first time it appears in he log.


Is it possible that multiple tests are running at the same time and
interfere with each other ?


I'm not sure how to debug this without access to the machine and
adding more debug messages.



Alex




On Fri, Jul 25, 2014 at 11:28 AM, Alexander Shraer <sh...@gmail.com>
wrote:

> Hi,
>
> Hongchao, could you please clarify what you propose ?
>
> If I understand correctly we have two main options - ZK-1989 that disables
> reconfig and keeps the old static config and a simpler fix which is to
> check for client ports on boot and shut down the server with an error if no
> ports were found.
>
> Patrick, I'll take a look on the log. Michi and I didn't succeed to get
> access to the build machine, which makes it very difficult to debug...
>
> Thanks,
> Alex
>
>
> On Fri, Jul 25, 2014 at 10:53 AM, Hongchao Deng <hd...@cloudera.com>
> wrote:
>
>> ZK-1989 gets pretty complicated if it needs to support the full backward
>> compatibility.
>>
>> My plan is divide a small task out: simply keep the old config and make it
>> work. There could be unexpected cases when users of old config tried to
>> use
>> reconfig.
>>
>> Is it okay for the first alpha release?
>>
>>
>> On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:
>>
>> > 1974 has been committed (kudos folks!), along with a few other patches
>> > that were ready to go.
>> >
>> > Hongchao, how is 1989 coming?
>> >
>> > Patrick
>> >
>> >
>> > On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
>> > > Can someone take a look at this issue? The windows c client build is
>> > > failing for a while now, would be great to fix this for 3.5.0...
>> > >
>> > > ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external
>> > symbol"
>> > >
>> > > Patrick
>> > >
>> > > On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
>> > > <rg...@itevenworks.net> wrote:
>> > >> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>> > >>
>> > >>> We've identified the issues with 1987, it would be good if folks
>> could
>> > >>> take a look.
>> > >>
>> > >>
>> > >> Great - thanks Patrick. Added some comments to the patch.
>> > >>
>> > >>
>> > >>> Nothing looks unsolvable, but we should tweak things a
>> > >>> bit before 3.5.0, esp given the current upgrade experience. The new
>> > >>> docs will help a lot - see
>> > >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need
>> to
>> > >>> review and commit.
>> > >>>
>> > >>
>> > >> Reading the docs, will follow-up with comments.
>> > >>
>> > >>
>> > >> -rgs
>> >
>>
>>
>>
>> --
>> *- Hongchao Deng*
>> *Software Engineer, CCE*
>>
>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Hi,

Hongchao, could you please clarify what you propose ?

If I understand correctly we have two main options - ZK-1989 that disables
reconfig and keeps the old static config and a simpler fix which is to
check for client ports on boot and shut down the server with an error if no
ports were found.

Patrick, I'll take a look on the log. Michi and I didn't succeed to get
access to the build machine, which makes it very difficult to debug...

Thanks,
Alex


On Fri, Jul 25, 2014 at 10:53 AM, Hongchao Deng <hd...@cloudera.com> wrote:

> ZK-1989 gets pretty complicated if it needs to support the full backward
> compatibility.
>
> My plan is divide a small task out: simply keep the old config and make it
> work. There could be unexpected cases when users of old config tried to use
> reconfig.
>
> Is it okay for the first alpha release?
>
>
> On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:
>
> > 1974 has been committed (kudos folks!), along with a few other patches
> > that were ready to go.
> >
> > Hongchao, how is 1989 coming?
> >
> > Patrick
> >
> >
> > On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
> > > Can someone take a look at this issue? The windows c client build is
> > > failing for a while now, would be great to fix this for 3.5.0...
> > >
> > > ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external
> > symbol"
> > >
> > > Patrick
> > >
> > > On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
> > > <rg...@itevenworks.net> wrote:
> > >> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
> > >>
> > >>> We've identified the issues with 1987, it would be good if folks
> could
> > >>> take a look.
> > >>
> > >>
> > >> Great - thanks Patrick. Added some comments to the patch.
> > >>
> > >>
> > >>> Nothing looks unsolvable, but we should tweak things a
> > >>> bit before 3.5.0, esp given the current upgrade experience. The new
> > >>> docs will help a lot - see
> > >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need
> to
> > >>> review and commit.
> > >>>
> > >>
> > >> Reading the docs, will follow-up with comments.
> > >>
> > >>
> > >> -rgs
> >
>
>
>
> --
> *- Hongchao Deng*
> *Software Engineer, CCE*
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Hongchao Deng <hd...@cloudera.com>.
ZK-1989 gets pretty complicated if it needs to support the full backward
compatibility.

My plan is divide a small task out: simply keep the old config and make it
work. There could be unexpected cases when users of old config tried to use
reconfig.

Is it okay for the first alpha release?


On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:

> 1974 has been committed (kudos folks!), along with a few other patches
> that were ready to go.
>
> Hongchao, how is 1989 coming?
>
> Patrick
>
>
> On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
> > Can someone take a look at this issue? The windows c client build is
> > failing for a while now, would be great to fix this for 3.5.0...
> >
> > ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external
> symbol"
> >
> > Patrick
> >
> > On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
> > <rg...@itevenworks.net> wrote:
> >> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
> >>
> >>> We've identified the issues with 1987, it would be good if folks could
> >>> take a look.
> >>
> >>
> >> Great - thanks Patrick. Added some comments to the patch.
> >>
> >>
> >>> Nothing looks unsolvable, but we should tweak things a
> >>> bit before 3.5.0, esp given the current upgrade experience. The new
> >>> docs will help a lot - see
> >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
> >>> review and commit.
> >>>
> >>
> >> Reading the docs, will follow-up with comments.
> >>
> >>
> >> -rgs
>



-- 
*- Hongchao Deng*
*Software Engineer, CCE*

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I'm afraid that testCurrentObserverIsParticipantInNewConfig is still
failing - it seems to be more frequent now - 5 times in the last 24
hours on apache Jenkins. Sorry to bug ya Alex but could you take
another look?

https://builds.apache.org/job/ZooKeeper-trunk/2386/

Thanks,

Patrick

On Fri, Jul 25, 2014 at 10:46 AM, Patrick Hunt <ph...@apache.org> wrote:
> 1974 has been committed (kudos folks!), along with a few other patches
> that were ready to go.
>
> Hongchao, how is 1989 coming?
>
> Patrick
>
>
> On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
>> Can someone take a look at this issue? The windows c client build is
>> failing for a while now, would be great to fix this for 3.5.0...
>>
>> ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external symbol"
>>
>> Patrick
>>
>> On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
>> <rg...@itevenworks.net> wrote:
>>> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>>>
>>>> We've identified the issues with 1987, it would be good if folks could
>>>> take a look.
>>>
>>>
>>> Great - thanks Patrick. Added some comments to the patch.
>>>
>>>
>>>> Nothing looks unsolvable, but we should tweak things a
>>>> bit before 3.5.0, esp given the current upgrade experience. The new
>>>> docs will help a lot - see
>>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
>>>> review and commit.
>>>>
>>>
>>> Reading the docs, will follow-up with comments.
>>>
>>>
>>> -rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
1974 has been committed (kudos folks!), along with a few other patches
that were ready to go.

Hongchao, how is 1989 coming?

Patrick


On Thu, Jul 24, 2014 at 4:50 PM, Patrick Hunt <ph...@apache.org> wrote:
> Can someone take a look at this issue? The windows c client build is
> failing for a while now, would be great to fix this for 3.5.0...
>
> ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external symbol"
>
> Patrick
>
> On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
> <rg...@itevenworks.net> wrote:
>> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> We've identified the issues with 1987, it would be good if folks could
>>> take a look.
>>
>>
>> Great - thanks Patrick. Added some comments to the patch.
>>
>>
>>> Nothing looks unsolvable, but we should tweak things a
>>> bit before 3.5.0, esp given the current upgrade experience. The new
>>> docs will help a lot - see
>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
>>> review and commit.
>>>
>>
>> Reading the docs, will follow-up with comments.
>>
>>
>> -rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Can someone take a look at this issue? The windows c client build is
failing for a while now, would be great to fix this for 3.5.0...

ZOOKEEPER-1974 winvs2008 jenkins job failing with "unresolved external symbol"

Patrick

On Thu, Jul 24, 2014 at 10:19 AM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:
>
>> We've identified the issues with 1987, it would be good if folks could
>> take a look.
>
>
> Great - thanks Patrick. Added some comments to the patch.
>
>
>> Nothing looks unsolvable, but we should tweak things a
>> bit before 3.5.0, esp given the current upgrade experience. The new
>> docs will help a lot - see
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
>> review and commit.
>>
>
> Reading the docs, will follow-up with comments.
>
>
> -rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 24 July 2014 09:47, Patrick Hunt <ph...@apache.org> wrote:

> We've identified the issues with 1987, it would be good if folks could
> take a look.


Great - thanks Patrick. Added some comments to the patch.


> Nothing looks unsolvable, but we should tweak things a
> bit before 3.5.0, esp given the current upgrade experience. The new
> docs will help a lot - see
> https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
> review and commit.
>

Reading the docs, will follow-up with comments.


-rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
We've identified the issues with 1987, it would be good if folks could
take a look. Nothing looks unsolvable, but we should tweak things a
bit before 3.5.0, esp given the current upgrade experience. The new
docs will help a lot - see
https://issues.apache.org/jira/browse/ZOOKEEPER-1660 which we need to
review and commit.

Patrick

On Wed, Jul 23, 2014 at 5:07 PM, Patrick Hunt <ph...@apache.org> wrote:
> The units are passing and I'm able to build the release candidate
> however even a basic 3 node "start the cluster, stop it, then restart"
> is failing. We'll need to look into this before an RC can be built.
>
> See: https://issues.apache.org/jira/browse/ZOOKEEPER-1987
>
> Patrick
>
> On Wed, Jul 23, 2014 at 3:06 PM, Raúl Gutiérrez Segalés
> <rg...@itevenworks.net> wrote:
>> On 23 July 2014 14:48, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> FYI: Currently running some tests and I'm about to create the
>>> branch-3.5 branch.
>>>
>>
>> w00t :-)
>>
>>
>> -rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
The units are passing and I'm able to build the release candidate
however even a basic 3 node "start the cluster, stop it, then restart"
is failing. We'll need to look into this before an RC can be built.

See: https://issues.apache.org/jira/browse/ZOOKEEPER-1987

Patrick

On Wed, Jul 23, 2014 at 3:06 PM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On 23 July 2014 14:48, Patrick Hunt <ph...@apache.org> wrote:
>
>> FYI: Currently running some tests and I'm about to create the
>> branch-3.5 branch.
>>
>
> w00t :-)
>
>
> -rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 23 July 2014 14:48, Patrick Hunt <ph...@apache.org> wrote:

> FYI: Currently running some tests and I'm about to create the
> branch-3.5 branch.
>

w00t :-)


-rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
FYI: Currently running some tests and I'm about to create the
branch-3.5 branch.

Patrick

On Wed, Jul 23, 2014 at 10:53 AM, Patrick Hunt <ph...@apache.org> wrote:
> I may try to create an RC0 today. Any objections? Something critical we need?
>
> Patrick
>
> On Tue, Jul 22, 2014 at 2:41 PM, Patrick Hunt <ph...@apache.org> wrote:
>> Thanks Alex. I've created a jira for this:
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1984 Let's discuss
>> further there.
>>
>> I will try the patch on my jenkins box later today.
>>
>> Thanks!
>>
>> Patrick
>>
>> On Tue, Jul 22, 2014 at 2:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
>>> Actually if servers 1 and 3 are talking and 3 is elected and not 1, it means
>>> that 3 also saw the reconfig. So it should also complete it when it reboots.
>>> To debug this I suggest to print out the last seen config in the beginning
>>> of leader.lead().
>>>
>>> Is it possible that writing the .next file to disk fails ?
>>>
>>> Alternatively we could just remove this part of the test (attached patch) -
>>> the test's goal is to check that the leader times out when it looses a
>>> quorum of the new config, and the part of the test that fails now is not
>>> needed to check that. There are other tests in ReconfigRecoveryTest that are
>>> supposed to check recovery.
>>>
>>>
>>>
>>> On Tue, Jul 22, 2014 at 1:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
>>>>
>>>> yep, I think what happens is that server 3 is becoming leader and not
>>>> server 1, so its not completing the reconfig. Let me think about how to
>>>> solve this...
>>>>
>>>>
>>>> On Tue, Jul 22, 2014 at 12:21 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>>>
>>>>> Also if you want to submit a patch that provides more insight (logs)
>>>>> for that operation/test lmk and I'll be happy to review/commit it.
>>>>> Should help with debugging the issue and debugging in the field.
>>>>>
>>>>> Thanks!
>>>>>
>>>>> Patrick
>>>>>
>>>>> On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>>> > Here's the logs (attached) for the test that failed. Nothing stuck out
>>>>> > at me - anything ring a bell?
>>>>> >
>>>>> > Patrick
>>>>> >
>>>>> > On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com>
>>>>> > wrote:
>>>>> >> Unfortunately doesn't look like we have enough logging going on there.
>>>>> >> For example would be nice to know what's the committed config and last
>>>>> >> seen
>>>>> >> config
>>>>> >> of the leader when it comes up (leader.lead()). and what configuration
>>>>> >> is
>>>>> >> sent in the NEWLEADER message
>>>>> >> sent out in LeaderHandler:
>>>>> >>
>>>>> >>                 QuorumPacket newLeaderQP = new
>>>>> >> QuorumPacket(Leader.NEWLEADER,
>>>>> >>                         newLeaderZxid,
>>>>> >> leader.self.getLastSeenQuorumVerifier()
>>>>> >>                                 .toString().getBytes(), null);
>>>>> >>
>>>>> >>
>>>>> >> I didn't know about the option to have a separate administrative
>>>>> >> interface,
>>>>> >> and just followed the flow of other commands... I agree that it would
>>>>> >> be
>>>>> >> cleaner.
>>>>> >>
>>>>> >>
>>>>> >>
>>>>> >>
>>>>> >> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org>
>>>>> >> wrote:
>>>>> >>
>>>>> >>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer
>>>>> >>> <sh...@gmail.com>
>>>>> >>> wrote:
>>>>> >>> > Hmm. It doesn't really make sense to me - the reconfig should be
>>>>> >>> completed
>>>>> >>> > before
>>>>> >>> > the servers come up and process new ops. We submitted the reconfig
>>>>> >>> > to
>>>>> >>> > server 1, it timed out
>>>>> >>> > on new quorum, but when 1 becomes leader again after 2 restarts 1
>>>>> >>> > should
>>>>> >>> > complete the reconfig.
>>>>> >>> > is 1 becoming leader after 2 restarts ?
>>>>> >>> >
>>>>> >>>
>>>>> >>> What should I look for in the logs? Any specific log messages that
>>>>> >>> would help debug?
>>>>> >>>
>>>>> >>> > About admin controls - reconfig/getConfig are open to everyone,
>>>>> >>> > unless
>>>>> >>> you
>>>>> >>> > set permissions on the configuration znode being written during
>>>>> >>> > reconfig.
>>>>> >>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>>>>> >>> >
>>>>> >>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>>>>> >>> > request.authInfo);
>>>>> >>> >
>>>>> >>>
>>>>> >>> So I can turn off all access then? (read and write). Should we ship
>>>>> >>> that as the default? We should add that to the docs.
>>>>> >>>
>>>>> >>> In the past we've always tried to hide this type of information from
>>>>> >>> clients (e.g. we don't expose the zk server address to the client for
>>>>> >>> a session). This seems like a very big departure. Why didn't we move
>>>>> >>> it to a separate, administrative, interface?
>>>>> >>>
>>>>> >>> Patrick
>>>>> >>>
>>>>> >>> >
>>>>> >>> >
>>>>> >>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com>
>>>>> >>> > wrote:
>>>>> >>> >
>>>>> >>> >> Looks like 3 hasn't been removed (unfortunately the assertion
>>>>> >>> >> doesn't
>>>>> >>> >> include any msg detail, but that's the way it looks to me like the
>>>>> >>> >> test is setup):
>>>>> >>> >>
>>>>> >>> >>         if (leavingServers != null) {
>>>>> >>> >>             for (String leaving : leavingServers)
>>>>> >>> >>
>>>>> >>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>>>>> >>> >>         }
>>>>> >>> >>
>>>>> >>> >> which is called from:
>>>>> >>> >>
>>>>> >>> >>         qu.restart(2);
>>>>> >>> >>         // Now that 2 is back up, they'll complete the reconfig
>>>>> >>> removing 3
>>>>> >>> >> and
>>>>> >>> >>         // can process other ops.
>>>>> >>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>>>>> >>> >>
>>>>> >>> >> It seems like the problem is that testServerHasConfig is not
>>>>> >>> >> waiting
>>>>> >>> >> for the configuration to be updated? In this case 2 was just
>>>>> >>> >> restarted
>>>>> >>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>>>>> >>> >> which might be why you aren't seeing the issue? hence the
>>>>> >>> >> flakeyness)
>>>>> >>> >>
>>>>> >>> >> Patrick
>>>>> >>> >>
>>>>> >>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer
>>>>> >>> >> <sh...@gmail.com>
>>>>> >>> >> wrote:
>>>>> >>> >> > Hi Patrick, I'm not sure why you're seeing this - it
>>>>> >>> >> > consistently
>>>>> >>> passes
>>>>> >>> >> on
>>>>> >>> >> > my machine. In case you'd like to take a look, the test has tons
>>>>> >>> >> > of
>>>>> >>> >> > comments explaining the scenario. Let me know how I can help.
>>>>> >>> >> >
>>>>> >>> >> >
>>>>> >>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>>>>> >>> wrote:
>>>>> >>> >> >
>>>>> >>> >> >> Hi Alex, I've also seen the test
>>>>> >>> >> >> "testLeaderTimesoutOnNewQuorum" fail
>>>>> >>> >> >> multiple times (not every time, but ~50%, so flakey) in the
>>>>> >>> >> >> last few
>>>>> >>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>>>>> >>> >> >> jenkins, I haven't see any other failures than this during the
>>>>> >>> >> >> past
>>>>> >>> >> >> few days).
>>>>> >>> >> >>
>>>>> >>> >> >> junit.framework.AssertionFailedError
>>>>> >>> >> >> at
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>> >>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>>>>> >>> >> >> at
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>> >>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>>>>> >>> >> >> at
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>> >>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>>>> >>> >> >>
>>>>> >>> >> >> Patrick
>>>>> >>> >> >>
>>>>> >>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer
>>>>> >>> >> >> <shralex@gmail.com
>>>>> >>> >
>>>>> >>> >> >> wrote:
>>>>> >>> >> >> > Hi Rakesh,
>>>>> >>> >> >> >
>>>>> >>> >> >> > Thanks for looking at this. In general even if we find the
>>>>> >>> >> >> > bug
>>>>> >>> since
>>>>> >>> >> we
>>>>> >>> >> >> > should test it before committing a fix, it seems better to
>>>>> >>> >> >> > remove
>>>>> >>> the
>>>>> >>> >> >> test
>>>>> >>> >> >> > for now and debug this on a build machine. I'm trying to get
>>>>> >>> access to
>>>>> >>> >> >> it.
>>>>> >>> >> >> >
>>>>> >>> >> >> > Looking at this log:
>>>>> >>> >> >> >
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>> >>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>>>>> >>> >> >> >
>>>>> >>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>>>>> >>> version
>>>>> >>> >> >> 200000000
>>>>> >>> >> >> > is already being sent around as committed!
>>>>> >>> >> >> >
>>>>> >>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>>>>> >>> >> >> >
>>>>> >>> >>
>>>>> >>> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>>>>> >>> ]
>>>>> >>> >> >> > - 2 Received version: 200000000 my version: 0
>>>>> >>> >> >> >
>>>>> >>> >> >> >
>>>>> >>> >> >> > and also in leader election messages.
>>>>> >>> >> >> >
>>>>> >>> >> >> > Also weird is that the version of 2 is 0 as if it is a
>>>>> >>> >> >> > joiner,
>>>>> >>> >> whereas we
>>>>> >>> >> >> > explicitly started it with 100000000.
>>>>> >>> >> >> > Then it makes sense that the new config can't be committed
>>>>> >>> >> >> > since
>>>>> >>> its
>>>>> >>> >> >> > version is not high enough...
>>>>> >>> >> >> >
>>>>> >>> >> >> > I wonder if its possible that not all servers from the
>>>>> >>> >> >> > previous
>>>>> >>> test
>>>>> >>> >> are
>>>>> >>> >> >> > dead and they are interfering...
>>>>> >>> >> >> >
>>>>> >>> >> >> >
>>>>> >>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R
>>>>> >>> >> >> > <ra...@huawei.com>
>>>>> >>> wrote:
>>>>> >>> >> >> >
>>>>> >>> >> >> >> Hi Alex,
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> Yeah it is consistently passing in my machine also.
>>>>> >>> >> >> >>
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> I have quickly gone through the
>>>>> >>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>>>>> >>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>>>>> >>> version)
>>>>> >>> >> >> has
>>>>> >>> >> >> >> not taken and still leader election is seeing 100000000
>>>>> >>> >> >> >> (n.config
>>>>> >>> >> >> version).
>>>>> >>> >> >> >> Unfortunately I didn't find the reason for not considering
>>>>> >>> >> >> >> the
>>>>> >>> >> updated
>>>>> >>> >> >> >> config version.
>>>>> >>> >> >> >>
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> Reference:
>>>>> >>> >> >> >>
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>> >>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>>>> >>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922]
>>>>> >>> >> >> >> -
>>>>> >>> >> >> >> Notification time out: 51200
>>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>>>> >>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922]
>>>>> >>> >> >> >> -
>>>>> >>> >> >> >> Notification time out: 51200
>>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >>
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>>>> >>> >> >> >> Notification:
>>>>> >>> 2
>>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>>> >>> >> >> >> (n.zxid), 0x1
>>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>>> >>> LOOKING
>>>>> >>> >> (my
>>>>> >>> >> >> >> state)100000000 (n.config version)
>>>>> >>> >> >> >>
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> -Rakesh
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> -----Original Message-----
>>>>> >>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>>>>> >>> >> >> >> Sent: 22 July 2014 11:57
>>>>> >>> >> >> >> To: dev@zookeeper.apache.org
>>>>> >>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> I tried to look into it, but the test consistently passes
>>>>> >>> >> >> >> locally
>>>>> >>> on
>>>>> >>> >> two
>>>>> >>> >> >> >> machines.
>>>>> >>> >> >> >> I don't currently have access to the build machine, but I
>>>>> >>> >> >> >> can try
>>>>> >>> to
>>>>> >>> >> ask
>>>>> >>> >> >> >> for access.
>>>>> >>> >> >> >> Unless anyone has a better suggestion, we could remove the
>>>>> >>> >> >> >> failing
>>>>> >>> >> test
>>>>> >>> >> >> in
>>>>> >>> >> >> >> the meanwhile and open a JIRA to add it back...
>>>>> >>> >> >> >>
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt
>>>>> >>> >> >> >> <ph...@apache.org>
>>>>> >>> >> >> wrote:
>>>>> >>> >> >> >>
>>>>> >>> >> >> >> > I'm seeing alot of test failures in
>>>>> >>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone
>>>>> >>> >> >> >> > take a
>>>>> >>> >> look?
>>>>> >>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>>>>> >>> >> >> >> >
>>>>> >>> >> >> >> >
>>>>> >>> >> >> >> >
>>>>> >>> >> >>
>>>>> >>>
>>>>> >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>>>>> >>> >> >> >> >
>>>>> >>> >>
>>>>> >>> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>>>>> >>> >> >> >> > tabpanel#comment-14069024
>>>>> >>> >> >> >> >
>>>>> >>> >> >> >> > Patrick
>>>>> >>> >> >> >> >
>>>>> >>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>>>>> >>> >> >> >> > <ra...@gmail.com> wrote:
>>>>> >>> >> >> >> > > lgtm +1
>>>>> >>> >> >> >> > >
>>>>> >>> >> >> >> > >
>>>>> >>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>>>>> >>> >> >> >> > > <fp...@yahoo.com.invalid>
>>>>> >>> >> >> >> > wrote:
>>>>> >>> >> >> >> > >
>>>>> >>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>>>>> >>> release, I
>>>>> >>> >> >> >> > >> +think
>>>>> >>> >> >> >> > 72
>>>>> >>> >> >> >> > >> biz hours is enough for the vote.
>>>>> >>> >> >> >> > >>
>>>>> >>> >> >> >> > >> -Flavio
>>>>> >>> >> >> >> > >>
>>>>> >>> >> >> >> > >> > -----Original Message-----
>>>>> >>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>>>>> >>> >> >> >> > >> > Sent: 21 July 2014 18:55
>>>>> >>> >> >> >> > >> > To: DevZooKeeper
>>>>> >>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>>>>> >>> >> >> >> > >> >
>>>>> >>> >> >> >> > >> > I fixed a number of issues. I also started a few
>>>>> >>> >> >> >> > >> > threads
>>>>> >>> with
>>>>> >>> >> >> >> > >> > builds@
>>>>> >>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and
>>>>> >>> >> >> >> > >> > I
>>>>> >>> worked
>>>>> >>> >> >> >> > through a
>>>>> >>> >> >> >> > >> > number of findbugs issues, it's not closed yet but
>>>>> >>> >> >> >> > >> > it's
>>>>> >>> pretty
>>>>> >>> >> >> >> close.
>>>>> >>> >> >> >> > >> >
>>>>> >>> >> >> >> > >> > I don't see why we can't create an RC and start
>>>>> >>> >> >> >> > >> > voting this
>>>>> >>> >> week
>>>>> >>> >> >> >> > though.
>>>>> >>> >> >> >> > >> > Anyone disagree?
>>>>> >>> >> >> >> > >> >
>>>>> >>> >> >> >> > >> > How long should we let the vote run, the std 72 biz
>>>>> >>> >> >> >> > >> > hours
>>>>> >>> or
>>>>> >>> >> >> >> > >> > should we
>>>>> >>> >> >> >> > >> plan
>>>>> >>> >> >> >> > >> > for more to allow folks more time to test?
>>>>> >>> >> >> >> > >> >
>>>>> >>> >> >> >> > >> > Patrick
>>>>> >>> >> >> >> > >> >
>>>>> >>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez
>>>>> >>> >> >> >> > >> > Segalés
>>>>> >>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>>>>> >>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt
>>>>> >>> >> >> >> > >> > > <ph...@apache.org>
>>>>> >>> >> wrote:
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins
>>>>> >>> >> >> >> > >> > >> ZK
>>>>> >>> jobs -
>>>>> >>> >> I'm
>>>>> >>> >> >> >> > trying
>>>>> >>> >> >> >> > >> > >> to fix some of the jobs that were broken during
>>>>> >>> >> >> >> > >> > >> the
>>>>> >>> recent
>>>>> >>> >> >> >> > >> > >> host upgrade.
>>>>> >>> >> >> >> > >> > >>
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > > How are things looking? Is it likely that we can
>>>>> >>> >> >> >> > >> > > have a
>>>>> >>> >> 3.5.0
>>>>> >>> >> >> >> > >> > > alpha release week or are we still blocked on
>>>>> >>> >> >> >> > >> > > Jenkins?
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > > -rgs
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >
>>>>> >>> >> >> >> > >> > >> Patrick
>>>>> >>> >> >> >> > >> > >>
>>>>> >>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>>>>> >>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>>>>> >>> >> >> >> > >> > >> wrote:
>>>>> >>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>>>>> >>> >> >> >> > >> > >> >
>>>>> >>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander
>>>>> >>> >> >> >> > >> > >> > Shraer
>>>>> >>> >> >> >> > >> > >> > <sh...@gmail.com>
>>>>> >>> >> >> >> > >> > >> wrote:
>>>>> >>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille
>>>>> >>> >> >> >> > >> > >> >> gave a
>>>>> >>> +1
>>>>> >>> >> and
>>>>> >>> >> >> >> > >> > >> >> all
>>>>> >>> >> >> >> > >> > >> subsequent
>>>>> >>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>>>>> >>> >> >> >> > >> > >> >>
>>>>> >>> >> >> >> > >> > >> >>
>>>>> >>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>>>>> >>> >> >> >> > >> > >> >> <phunt@apache.org
>>>>> >>> >> >> >> > >
>>>>> >>> >> >> >> > >> > wrote:
>>>>> >>> >> >> >> > >> > >> >>
>>>>> >>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all
>>>>> >>> >> >> >> > >> > >> >>> failing due
>>>>> >>> to,
>>>>> >>> >> >> >> > >> > >> >>> for
>>>>> >>> >> >> >> > e.g.
>>>>> >>> >> >> >> > >> > >> >>> findbugs issues. At the very least our
>>>>> >>> >> >> >> > >> > >> >>> build/test/ci
>>>>> >>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok
>>>>> >>> >> >> >> > >> > >> >>> (the
>>>>> >>> recent
>>>>> >>> >> >> >> > >> > >> >>> startServer fix
>>>>> >>> >> >> >> > and
>>>>> >>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go
>>>>> >>> >> >> >> > >> > >> >>> a
>>>>> >>> long
>>>>> >>> >> way
>>>>> >>> >> >> >> > >> > >> >>> on
>>>>> >>> >> >> >> > that
>>>>> >>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should
>>>>> >>> >> >> >> > >> > >> >>> be
>>>>> >>> >> cleaned
>>>>> >>> >> >> >> > >> > >> >>> up before we cut a release. I started a
>>>>> >>> >> >> >> > >> > >> >>> separate
>>>>> >>> >> thread to
>>>>> >>> >> >> >> > >> > >> >>> discuss
>>>>> >>> >> >> >> > >> the
>>>>> >>> >> >> >> > >> > findbugs issue.
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul
>>>>> >>> >> >> >> > >> > >> >>> on
>>>>> >>> 1919?
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >> >>> Patrick
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio
>>>>> >>> >> >> >> > >> > >> >>> Junqueira
>>>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>>>>> >>> >> >> >> > >> > >> >>> > My take:
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker
>>>>> >>> >> >> >> > >> > >> >>> > and
>>>>> >>> it
>>>>> >>> >> can
>>>>> >>> >> >> >> > >> > >> >>> > go
>>>>> >>> >> >> >> > in.
>>>>> >>> >> >> >> > >> > >> >>> > See
>>>>> >>> >> >> >> > >> > >> the
>>>>> >>> >> >> >> > >> > >> >>> jira for comments.
>>>>> >>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the
>>>>> >>> >> >> >> > >> > >> >>> > first
>>>>> >>> alpha.
>>>>> >>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha
>>>>> >>> >> >> >> > >> > >> >>> > depending on
>>>>> >>> >> >> >> > >> > >> >>> > ZK-1919
>>>>> >>> >> >> >> > and
>>>>> >>> >> >> >> > >> > >> ZK-1910,
>>>>> >>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>> > If you agree with this, then we should be
>>>>> >>> >> >> >> > >> > >> >>> > able to
>>>>> >>> >> cut a
>>>>> >>> >> >> >> > >> > >> >>> > candidate by
>>>>> >>> >> >> >> > >> > >> the
>>>>> >>> >> >> >> > >> > >> >>> end of this week.
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>> > -Flavio
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>>>>> >>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>>>>> >>> >> >> >> > >> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c
>>>>> >>> >> >> >> > >> > >> >>> >> client
>>>>> >>> >> test
>>>>> >>> >> >> >> > >> > >> >>> >> log output
>>>>> >>> >> >> >> > >> > >> here
>>>>> >>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>>>>> >>> >> >> >> > >> > >> >>> >>
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >>
>>>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>>>>> >>> >> >> >> > >> > >> r-
>>>>> >>> >> >> >> > >> > trunk
>>>>> >>> >> >> >> > >> > >> /2372/
>>>>> >>> >> >> >> > >> > >> >>> >>
>>>>> >>> >> >> >> > >> > >> >>> >> Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>
>>>>> >>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick
>>>>> >>> >> >> >> > >> > >> >>> >> Hunt
>>>>> >>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>>>>> >>> >> >> >> > >> > >> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0
>>>>> >>> >> >> >> > >> > >> >>> >>> (not
>>>>> >>> >> clear
>>>>> >>> >> >> >> > >> > >> >>> >>> to me which
>>>>> >>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported
>>>>> >>> >> >> >> > >> > >> >>> >>> is
>>>>> >>> hitting
>>>>> >>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as
>>>>> >>> >> >> >> > >> > >> >>> >>> well.
>>>>> >>> I've
>>>>> >>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>>>>> >>> >> >> >> > >> > >> to
>>>>> >>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So
>>>>> >>> >> >> >> > >> > >> >>> >>> while it
>>>>> >>> >> won't
>>>>> >>> >> >> >> > >> > >> >>> >>> go
>>>>> >>> >> >> >> > to
>>>>> >>> >> >> >> > >> > >> console
>>>>> >>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a
>>>>> >>> >> >> >> > >> > >> >>> >>> failure.
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews
>>>>> >>> >> >> >> > >> > >> >>> >>> for the
>>>>> >>> >> jetty
>>>>> >>> >> >> >> > server
>>>>> >>> >> >> >> > >> > >> support
>>>>> >>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready
>>>>> >>> >> >> >> > >> > >> >>> >>> soon.
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Raul also requested that someone
>>>>> >>> >> >> >> > >> > >> >>> >>> prioritize
>>>>> >>> >> reviewing
>>>>> >>> >> >> >> > >> > >> "ZOOKEEPER-1919
>>>>> >>> >> >> >> > >> > >> >>> >>> Update the C implementation of
>>>>> >>> >> >> >> > >> > >> >>> >>> removeWatches to
>>>>> >>> >> have
>>>>> >>> >> >> >> > >> > >> >>> >>> it
>>>>> >>> >> >> >> > >> > match
>>>>> >>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it
>>>>> >>> >> >> >> > >> > >> >>> >>> in
>>>>> >>> 3.5.0.
>>>>> >>> >> >> >> > >> Flavio/Michi?
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the
>>>>> >>> >> >> >> > >> > >> >>> >>> flakey c
>>>>> >>> >> client
>>>>> >>> >> >> >> > >> > >> >>> >>> reconfig
>>>>> >>> >> >> >> > >> > >> test -
>>>>> >>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test
>>>>> >>> >> >> >> > >> > >> >>> >>> infra!
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like
>>>>> >>> >> >> >> > >> > >> >>> >>> we're
>>>>> >>> >> pretty
>>>>> >>> >> >> >> > close.
>>>>> >>> >> >> >> > >> > >> >>> >>> Do
>>>>> >>> >> >> >> > >> > >> folks
>>>>> >>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at
>>>>> >>> >> >> >> > >> > >> >>> >>> this
>>>>> >>> point?
>>>>> >>> >> >> >> > >> > >> >>> >>> (with a few
>>>>> >>> >> >> >> > >> > >> pending
>>>>> >>> >> >> >> > >> > >> >>> >>> as above)
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>>
>>>>> >>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl
>>>>> >>> >> >> >> > >> > >> >>> >>> Gutiérrez
>>>>> >>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>> Junqueira"
>>>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>>>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much,
>>>>> >>> >> >> >> > >> > >> >>> >>>>> what if
>>>>> >>> we
>>>>> >>> >> >> >> > >> > >> >>> >>>>> release
>>>>> >>> >> >> >> > an
>>>>> >>> >> >> >> > >> > >> >>> >>>>> alpha
>>>>> >>> >> >> >> > >> > >> >>> version
>>>>> >>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one
>>>>> >>> >> >> >> > >> > >> >>> >>>> in
>>>>> >>> 2-3
>>>>> >>> >> >> >> > >> > >> >>> >>>> weeks
>>>>> >>> >> >> >> > time?
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>> +1
>>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>> -rgs
>>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>> -Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>>>>> >>> Gutiérrez
>>>>> >>> >> >> >> > Segalés <
>>>>> >>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>>>>> >>> >> >> >> > >> > wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> 5.1.0
>>>>> >>> >> (from 8
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>>>> >>> >> >> >> > the
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> last
>>>>> >>> >> >> >> > >> > >> >>> check).
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Michi, and
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>>>>> >>> >> >> >> > >> > >> threatening
>>>>> >>> >> >> >> > >> > >> >>> to
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>>>> >>> >> general
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> on
>>>>> >>> >> >> >> > the
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> to
>>>>> >>> see.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> consider -
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>>>>> >>> >> >> >> > >> > >> jdk6
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> by
>>>>> >>> Oracle
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>>>>> >>> >> >> >> > >> > >> some
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> gotta
>>>>> >>> move
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>>>>> >>> >> >> >> > >> > >> >>> can't
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> that we
>>>>> >>> are
>>>>> >>> >> >> >> > currently
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>>>> >>> 8.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>>> >>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> using
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>>>>> >>> >> >> >> > >> > >> >>> upcoming
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> usage go
>>>>> >>> >> >> >> wild...).
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>> -rgs
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>>>>> >>> >> Junqueira
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>>>>> >>> already,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>>>>> >>> >> >> >> > >> > >> >>> >>>> there. I
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> LETest
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>>>>> >>> >> >> >> > >> > >> last QA
>>>>> >>> >> >> >> > >> > >> >>> run
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> affect
>>>>> >>> >> LETest,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>>>> >>> >> >> >> > in
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> fact
>>>>> >>> >> >> >> > >> > >> it
>>>>> >>> >> >> >> > >> > >> >>> fails
>>>>> >>> >> >> >> > >> > >> >>> >>>> in
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> failure
>>>>> >>> >> doesn't
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> seem
>>>>> >>> >> >> >> > to
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> be
>>>>> >>> >> >> >> > >> > >> >>> related
>>>>> >>> >> >> >> > >> > >> >>> >>>> to the
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> say
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>>>>> >>> >> >> >> > >> > >> about
>>>>> >>> >> >> >> > >> > >> >>> it.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> Patrick
>>>>> >>> >> Hunt <
>>>>> >>> >> >> >> > >> > >> phunt@apache.org>
>>>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> can
>>>>> >>> get
>>>>> >>> >> >> >> > >> > >> reviewed/finalized
>>>>> >>> >> >> >> > >> > >> >>> before
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>>>>> >>> cutting
>>>>> >>> >> an
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>>>>> >>> >> >> >> > >> > soonish...
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid>
>>>>> >>> >> >> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>>>>> >>> versions.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810)
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> and
>>>>> >>> >> ZK-1863
>>>>> >>> >> >> in.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>>>>> >>> >> >> >> > >> > >> both
>>>>> >>> >> >> >> > >> > >> >>> >>>> patch
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> is
>>>>> >>> still
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> open because we
>>>>> >>> >> >> >> > >> > >> >>> need a
>>>>> >>> >> >> >> > >> > >> >>> >>>> 3.4
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> Hunt
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>>>>> >>> >> >> >> > >> > >> >>> wrote:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> about it
>>>>> >>> for
>>>>> >>> >> a
>>>>> >>> >> >> >> > while, a
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>>>>> >>> >> >> >> > >> > >> >>> people
>>>>> >>> >> >> >> > >> > >> >>> >>>> have
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>>>>> >>> contacted
>>>>> >>> >> me
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>>>>> >>> >> >> >> > >> > >> that
>>>>> >>> >> >> >> > >> > >> >>> they
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>>> >>> >> first
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>>>>> >>> >> >> >> > >> > release.
>>>>> >>> >> >> >> > >> > >> Every
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> wait for
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>>>>> >>> >> >> >> > >> > >> never
>>>>> >>> >> >> >> > >> > >> >>> get
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> tons of
>>>>> >>> >> great
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>>>>> >>> >> >> >> > >> > >> >>> lots of
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>>>>> >>> release so
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>>>>> >>> >> >> >> > >> > >> use
>>>>> >>> >> >> >> > >> > >> >>> it,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> stable
>>>>> >>> except
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>>>>> >>> >> >> >> > >> > >> >>> flakey
>>>>> >>> >> >> >> > >> > >> >>> >>>> test
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> committed
>>>>> >>> >> today to
>>>>> >>> >> >> >> > >> > trunk.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>>>>> >>> >> >> >> > >> > >> that
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>>> >>> code on
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>>>>> >>> >> >> >> > and
>>>>> >>> >> >> >> > >> > jdk8.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>>>>> >>> should
>>>>> >>> >> plan
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>>>>> >>> >> >> >> > >> > >> releases:
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>>>>> >>> >> >> >> > for
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>>>>> >>> >> >> >> > >> > >> time.
>>>>> >>> >> >> >> > >> > >> >>> >>>> What I
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> similar to
>>>>> >>> >> what
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>>>>> >>> >> >> >> > did
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>>>>> >>> >> >> >> > >> > >> 3.4.
>>>>> >>> >> >> >> > >> > >> >>> >>>> (this is
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> during
>>>>> >>> >> their
>>>>> >>> >> >> >> > Hadoop 2
>>>>> >>> >> >> >> > >> > >> release
>>>>> >>> >> >> >> > >> > >> >>> >>>> cycle)
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>>>>> >>> >> >> >> > >> > >> can run
>>>>> >>> >> >> >> > >> > >> >>> >>>> and
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>>>>> >>> blockers
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>>>>> >>> >> >> >> > feel
>>>>> >>> >> >> >> > >> > >> >>> comfortable
>>>>> >>> >> >> >> > >> > >> >>> >>>> with
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>>>>> >>> switch
>>>>> >>> >> to
>>>>> >>> >> >> >> beta.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>>>>> >>> >> >> >> > >> > >> get
>>>>> >>> >> >> >> > >> > >> >>> >>>> some
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>>>> >>> >> moniker
>>>>> >>> >> >> >> > >> > and
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>>>>> >>> >> >> >> > >> > >> >>> making
>>>>> >>> >> >> >> > >> > >> >>> >>>> it
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> will
>>>>> >>> >> become
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>>> >>> >> >> >> > >> > >> >>> "current/stable"
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.1-alpha (3
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>>>>> >>> >> 3.5.3-beta
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.5-beta
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>>>> >>> but
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>>>>> >>> >> >> >> > >> > >> "stable" vs
>>>>> >>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> still
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>>>>> >>> >> >> >> > >> > >> shake
>>>>> >>> >> >> >> > >> > >> >>> >>>> out)
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases
>>>>> >>> for
>>>>> >>> >> >> >> > production
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>>>>> >>> >> >> >> > >> > >> >>> stable,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>> etc...
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently,
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> are any
>>>>> >>> of
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>>>>> >>> >> >> >> > >> > >> that
>>>>> >>> >> >> >> > >> > >> >>> >>>> should
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>>>> >>> couple
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>>>>> >>> >> >> >> > >> > >> find
>>>>> >>> >> >> >> > >> > >> >>> >>>> this a
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ball
>>>>> >>> >> rolling to
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>>>>> >>> >> >> >> > an
>>>>> >>> >> >> >> > >> RC.
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>>> >>> >> >> >> > >> > >> >>> >
>>>>> >>> >> >> >> > >> > >> >>>
>>>>> >>> >> >> >> > >> > >>
>>>>> >>> >> >> >> > >>
>>>>> >>> >> >> >> > >>
>>>>> >>> >> >> >> >
>>>>> >>> >> >> >>
>>>>> >>> >> >>
>>>>> >>> >>
>>>>> >>>
>>>>
>>>>
>>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I may try to create an RC0 today. Any objections? Something critical we need?

Patrick

On Tue, Jul 22, 2014 at 2:41 PM, Patrick Hunt <ph...@apache.org> wrote:
> Thanks Alex. I've created a jira for this:
> https://issues.apache.org/jira/browse/ZOOKEEPER-1984 Let's discuss
> further there.
>
> I will try the patch on my jenkins box later today.
>
> Thanks!
>
> Patrick
>
> On Tue, Jul 22, 2014 at 2:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
>> Actually if servers 1 and 3 are talking and 3 is elected and not 1, it means
>> that 3 also saw the reconfig. So it should also complete it when it reboots.
>> To debug this I suggest to print out the last seen config in the beginning
>> of leader.lead().
>>
>> Is it possible that writing the .next file to disk fails ?
>>
>> Alternatively we could just remove this part of the test (attached patch) -
>> the test's goal is to check that the leader times out when it looses a
>> quorum of the new config, and the part of the test that fails now is not
>> needed to check that. There are other tests in ReconfigRecoveryTest that are
>> supposed to check recovery.
>>
>>
>>
>> On Tue, Jul 22, 2014 at 1:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
>>>
>>> yep, I think what happens is that server 3 is becoming leader and not
>>> server 1, so its not completing the reconfig. Let me think about how to
>>> solve this...
>>>
>>>
>>> On Tue, Jul 22, 2014 at 12:21 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>>
>>>> Also if you want to submit a patch that provides more insight (logs)
>>>> for that operation/test lmk and I'll be happy to review/commit it.
>>>> Should help with debugging the issue and debugging in the field.
>>>>
>>>> Thanks!
>>>>
>>>> Patrick
>>>>
>>>> On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>> > Here's the logs (attached) for the test that failed. Nothing stuck out
>>>> > at me - anything ring a bell?
>>>> >
>>>> > Patrick
>>>> >
>>>> > On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com>
>>>> > wrote:
>>>> >> Unfortunately doesn't look like we have enough logging going on there.
>>>> >> For example would be nice to know what's the committed config and last
>>>> >> seen
>>>> >> config
>>>> >> of the leader when it comes up (leader.lead()). and what configuration
>>>> >> is
>>>> >> sent in the NEWLEADER message
>>>> >> sent out in LeaderHandler:
>>>> >>
>>>> >>                 QuorumPacket newLeaderQP = new
>>>> >> QuorumPacket(Leader.NEWLEADER,
>>>> >>                         newLeaderZxid,
>>>> >> leader.self.getLastSeenQuorumVerifier()
>>>> >>                                 .toString().getBytes(), null);
>>>> >>
>>>> >>
>>>> >> I didn't know about the option to have a separate administrative
>>>> >> interface,
>>>> >> and just followed the flow of other commands... I agree that it would
>>>> >> be
>>>> >> cleaner.
>>>> >>
>>>> >>
>>>> >>
>>>> >>
>>>> >> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org>
>>>> >> wrote:
>>>> >>
>>>> >>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer
>>>> >>> <sh...@gmail.com>
>>>> >>> wrote:
>>>> >>> > Hmm. It doesn't really make sense to me - the reconfig should be
>>>> >>> completed
>>>> >>> > before
>>>> >>> > the servers come up and process new ops. We submitted the reconfig
>>>> >>> > to
>>>> >>> > server 1, it timed out
>>>> >>> > on new quorum, but when 1 becomes leader again after 2 restarts 1
>>>> >>> > should
>>>> >>> > complete the reconfig.
>>>> >>> > is 1 becoming leader after 2 restarts ?
>>>> >>> >
>>>> >>>
>>>> >>> What should I look for in the logs? Any specific log messages that
>>>> >>> would help debug?
>>>> >>>
>>>> >>> > About admin controls - reconfig/getConfig are open to everyone,
>>>> >>> > unless
>>>> >>> you
>>>> >>> > set permissions on the configuration znode being written during
>>>> >>> > reconfig.
>>>> >>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>>>> >>> >
>>>> >>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>>>> >>> > request.authInfo);
>>>> >>> >
>>>> >>>
>>>> >>> So I can turn off all access then? (read and write). Should we ship
>>>> >>> that as the default? We should add that to the docs.
>>>> >>>
>>>> >>> In the past we've always tried to hide this type of information from
>>>> >>> clients (e.g. we don't expose the zk server address to the client for
>>>> >>> a session). This seems like a very big departure. Why didn't we move
>>>> >>> it to a separate, administrative, interface?
>>>> >>>
>>>> >>> Patrick
>>>> >>>
>>>> >>> >
>>>> >>> >
>>>> >>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com>
>>>> >>> > wrote:
>>>> >>> >
>>>> >>> >> Looks like 3 hasn't been removed (unfortunately the assertion
>>>> >>> >> doesn't
>>>> >>> >> include any msg detail, but that's the way it looks to me like the
>>>> >>> >> test is setup):
>>>> >>> >>
>>>> >>> >>         if (leavingServers != null) {
>>>> >>> >>             for (String leaving : leavingServers)
>>>> >>> >>
>>>> >>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>>>> >>> >>         }
>>>> >>> >>
>>>> >>> >> which is called from:
>>>> >>> >>
>>>> >>> >>         qu.restart(2);
>>>> >>> >>         // Now that 2 is back up, they'll complete the reconfig
>>>> >>> removing 3
>>>> >>> >> and
>>>> >>> >>         // can process other ops.
>>>> >>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>>>> >>> >>
>>>> >>> >> It seems like the problem is that testServerHasConfig is not
>>>> >>> >> waiting
>>>> >>> >> for the configuration to be updated? In this case 2 was just
>>>> >>> >> restarted
>>>> >>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>>>> >>> >> which might be why you aren't seeing the issue? hence the
>>>> >>> >> flakeyness)
>>>> >>> >>
>>>> >>> >> Patrick
>>>> >>> >>
>>>> >>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer
>>>> >>> >> <sh...@gmail.com>
>>>> >>> >> wrote:
>>>> >>> >> > Hi Patrick, I'm not sure why you're seeing this - it
>>>> >>> >> > consistently
>>>> >>> passes
>>>> >>> >> on
>>>> >>> >> > my machine. In case you'd like to take a look, the test has tons
>>>> >>> >> > of
>>>> >>> >> > comments explaining the scenario. Let me know how I can help.
>>>> >>> >> >
>>>> >>> >> >
>>>> >>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>>>> >>> wrote:
>>>> >>> >> >
>>>> >>> >> >> Hi Alex, I've also seen the test
>>>> >>> >> >> "testLeaderTimesoutOnNewQuorum" fail
>>>> >>> >> >> multiple times (not every time, but ~50%, so flakey) in the
>>>> >>> >> >> last few
>>>> >>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>>>> >>> >> >> jenkins, I haven't see any other failures than this during the
>>>> >>> >> >> past
>>>> >>> >> >> few days).
>>>> >>> >> >>
>>>> >>> >> >> junit.framework.AssertionFailedError
>>>> >>> >> >> at
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>> >>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>>>> >>> >> >> at
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>> >>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>>>> >>> >> >> at
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>> >>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>>> >>> >> >>
>>>> >>> >> >> Patrick
>>>> >>> >> >>
>>>> >>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer
>>>> >>> >> >> <shralex@gmail.com
>>>> >>> >
>>>> >>> >> >> wrote:
>>>> >>> >> >> > Hi Rakesh,
>>>> >>> >> >> >
>>>> >>> >> >> > Thanks for looking at this. In general even if we find the
>>>> >>> >> >> > bug
>>>> >>> since
>>>> >>> >> we
>>>> >>> >> >> > should test it before committing a fix, it seems better to
>>>> >>> >> >> > remove
>>>> >>> the
>>>> >>> >> >> test
>>>> >>> >> >> > for now and debug this on a build machine. I'm trying to get
>>>> >>> access to
>>>> >>> >> >> it.
>>>> >>> >> >> >
>>>> >>> >> >> > Looking at this log:
>>>> >>> >> >> >
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>> >>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>>>> >>> >> >> >
>>>> >>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>>>> >>> version
>>>> >>> >> >> 200000000
>>>> >>> >> >> > is already being sent around as committed!
>>>> >>> >> >> >
>>>> >>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>>>> >>> >> >> >
>>>> >>> >>
>>>> >>> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>>>> >>> ]
>>>> >>> >> >> > - 2 Received version: 200000000 my version: 0
>>>> >>> >> >> >
>>>> >>> >> >> >
>>>> >>> >> >> > and also in leader election messages.
>>>> >>> >> >> >
>>>> >>> >> >> > Also weird is that the version of 2 is 0 as if it is a
>>>> >>> >> >> > joiner,
>>>> >>> >> whereas we
>>>> >>> >> >> > explicitly started it with 100000000.
>>>> >>> >> >> > Then it makes sense that the new config can't be committed
>>>> >>> >> >> > since
>>>> >>> its
>>>> >>> >> >> > version is not high enough...
>>>> >>> >> >> >
>>>> >>> >> >> > I wonder if its possible that not all servers from the
>>>> >>> >> >> > previous
>>>> >>> test
>>>> >>> >> are
>>>> >>> >> >> > dead and they are interfering...
>>>> >>> >> >> >
>>>> >>> >> >> >
>>>> >>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R
>>>> >>> >> >> > <ra...@huawei.com>
>>>> >>> wrote:
>>>> >>> >> >> >
>>>> >>> >> >> >> Hi Alex,
>>>> >>> >> >> >>
>>>> >>> >> >> >> Yeah it is consistently passing in my machine also.
>>>> >>> >> >> >>
>>>> >>> >> >> >>
>>>> >>> >> >> >> I have quickly gone through the
>>>> >>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>>>> >>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>>>> >>> version)
>>>> >>> >> >> has
>>>> >>> >> >> >> not taken and still leader election is seeing 100000000
>>>> >>> >> >> >> (n.config
>>>> >>> >> >> version).
>>>> >>> >> >> >> Unfortunately I didn't find the reason for not considering
>>>> >>> >> >> >> the
>>>> >>> >> updated
>>>> >>> >> >> >> config version.
>>>> >>> >> >> >>
>>>> >>> >> >> >>
>>>> >>> >> >> >> Reference:
>>>> >>> >> >> >>
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>> >>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>>> >>> >> >> >>
>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>>> >>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922]
>>>> >>> >> >> >> -
>>>> >>> >> >> >> Notification time out: 51200
>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>>> >>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922]
>>>> >>> >> >> >> -
>>>> >>> >> >> >> Notification time out: 51200
>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >>
>>>> >>> >> >> >>
>>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>>> >>> >> >> >> Notification:
>>>> >>> 2
>>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>>> >>> >> >> >> (n.zxid), 0x1
>>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>>> >>> LOOKING
>>>> >>> >> (my
>>>> >>> >> >> >> state)100000000 (n.config version)
>>>> >>> >> >> >>
>>>> >>> >> >> >>
>>>> >>> >> >> >> -Rakesh
>>>> >>> >> >> >>
>>>> >>> >> >> >> -----Original Message-----
>>>> >>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>>>> >>> >> >> >> Sent: 22 July 2014 11:57
>>>> >>> >> >> >> To: dev@zookeeper.apache.org
>>>> >>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>>> >>> >> >> >>
>>>> >>> >> >> >> I tried to look into it, but the test consistently passes
>>>> >>> >> >> >> locally
>>>> >>> on
>>>> >>> >> two
>>>> >>> >> >> >> machines.
>>>> >>> >> >> >> I don't currently have access to the build machine, but I
>>>> >>> >> >> >> can try
>>>> >>> to
>>>> >>> >> ask
>>>> >>> >> >> >> for access.
>>>> >>> >> >> >> Unless anyone has a better suggestion, we could remove the
>>>> >>> >> >> >> failing
>>>> >>> >> test
>>>> >>> >> >> in
>>>> >>> >> >> >> the meanwhile and open a JIRA to add it back...
>>>> >>> >> >> >>
>>>> >>> >> >> >>
>>>> >>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt
>>>> >>> >> >> >> <ph...@apache.org>
>>>> >>> >> >> wrote:
>>>> >>> >> >> >>
>>>> >>> >> >> >> > I'm seeing alot of test failures in
>>>> >>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone
>>>> >>> >> >> >> > take a
>>>> >>> >> look?
>>>> >>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>>>> >>> >> >> >> >
>>>> >>> >> >> >> >
>>>> >>> >> >> >> >
>>>> >>> >> >>
>>>> >>>
>>>> >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>>>> >>> >> >> >> >
>>>> >>> >>
>>>> >>> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>>>> >>> >> >> >> > tabpanel#comment-14069024
>>>> >>> >> >> >> >
>>>> >>> >> >> >> > Patrick
>>>> >>> >> >> >> >
>>>> >>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>>>> >>> >> >> >> > <ra...@gmail.com> wrote:
>>>> >>> >> >> >> > > lgtm +1
>>>> >>> >> >> >> > >
>>>> >>> >> >> >> > >
>>>> >>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>>>> >>> >> >> >> > > <fp...@yahoo.com.invalid>
>>>> >>> >> >> >> > wrote:
>>>> >>> >> >> >> > >
>>>> >>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>>>> >>> release, I
>>>> >>> >> >> >> > >> +think
>>>> >>> >> >> >> > 72
>>>> >>> >> >> >> > >> biz hours is enough for the vote.
>>>> >>> >> >> >> > >>
>>>> >>> >> >> >> > >> -Flavio
>>>> >>> >> >> >> > >>
>>>> >>> >> >> >> > >> > -----Original Message-----
>>>> >>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>>>> >>> >> >> >> > >> > Sent: 21 July 2014 18:55
>>>> >>> >> >> >> > >> > To: DevZooKeeper
>>>> >>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>>>> >>> >> >> >> > >> >
>>>> >>> >> >> >> > >> > I fixed a number of issues. I also started a few
>>>> >>> >> >> >> > >> > threads
>>>> >>> with
>>>> >>> >> >> >> > >> > builds@
>>>> >>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and
>>>> >>> >> >> >> > >> > I
>>>> >>> worked
>>>> >>> >> >> >> > through a
>>>> >>> >> >> >> > >> > number of findbugs issues, it's not closed yet but
>>>> >>> >> >> >> > >> > it's
>>>> >>> pretty
>>>> >>> >> >> >> close.
>>>> >>> >> >> >> > >> >
>>>> >>> >> >> >> > >> > I don't see why we can't create an RC and start
>>>> >>> >> >> >> > >> > voting this
>>>> >>> >> week
>>>> >>> >> >> >> > though.
>>>> >>> >> >> >> > >> > Anyone disagree?
>>>> >>> >> >> >> > >> >
>>>> >>> >> >> >> > >> > How long should we let the vote run, the std 72 biz
>>>> >>> >> >> >> > >> > hours
>>>> >>> or
>>>> >>> >> >> >> > >> > should we
>>>> >>> >> >> >> > >> plan
>>>> >>> >> >> >> > >> > for more to allow folks more time to test?
>>>> >>> >> >> >> > >> >
>>>> >>> >> >> >> > >> > Patrick
>>>> >>> >> >> >> > >> >
>>>> >>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez
>>>> >>> >> >> >> > >> > Segalés
>>>> >>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>>>> >>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt
>>>> >>> >> >> >> > >> > > <ph...@apache.org>
>>>> >>> >> wrote:
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins
>>>> >>> >> >> >> > >> > >> ZK
>>>> >>> jobs -
>>>> >>> >> I'm
>>>> >>> >> >> >> > trying
>>>> >>> >> >> >> > >> > >> to fix some of the jobs that were broken during
>>>> >>> >> >> >> > >> > >> the
>>>> >>> recent
>>>> >>> >> >> >> > >> > >> host upgrade.
>>>> >>> >> >> >> > >> > >>
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > > How are things looking? Is it likely that we can
>>>> >>> >> >> >> > >> > > have a
>>>> >>> >> 3.5.0
>>>> >>> >> >> >> > >> > > alpha release week or are we still blocked on
>>>> >>> >> >> >> > >> > > Jenkins?
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > > -rgs
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >
>>>> >>> >> >> >> > >> > >> Patrick
>>>> >>> >> >> >> > >> > >>
>>>> >>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>>>> >>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>>>> >>> >> >> >> > >> > >> wrote:
>>>> >>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>>>> >>> >> >> >> > >> > >> >
>>>> >>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander
>>>> >>> >> >> >> > >> > >> > Shraer
>>>> >>> >> >> >> > >> > >> > <sh...@gmail.com>
>>>> >>> >> >> >> > >> > >> wrote:
>>>> >>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille
>>>> >>> >> >> >> > >> > >> >> gave a
>>>> >>> +1
>>>> >>> >> and
>>>> >>> >> >> >> > >> > >> >> all
>>>> >>> >> >> >> > >> > >> subsequent
>>>> >>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>>>> >>> >> >> >> > >> > >> >>
>>>> >>> >> >> >> > >> > >> >>
>>>> >>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>>>> >>> >> >> >> > >> > >> >> <phunt@apache.org
>>>> >>> >> >> >> > >
>>>> >>> >> >> >> > >> > wrote:
>>>> >>> >> >> >> > >> > >> >>
>>>> >>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all
>>>> >>> >> >> >> > >> > >> >>> failing due
>>>> >>> to,
>>>> >>> >> >> >> > >> > >> >>> for
>>>> >>> >> >> >> > e.g.
>>>> >>> >> >> >> > >> > >> >>> findbugs issues. At the very least our
>>>> >>> >> >> >> > >> > >> >>> build/test/ci
>>>> >>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok
>>>> >>> >> >> >> > >> > >> >>> (the
>>>> >>> recent
>>>> >>> >> >> >> > >> > >> >>> startServer fix
>>>> >>> >> >> >> > and
>>>> >>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go
>>>> >>> >> >> >> > >> > >> >>> a
>>>> >>> long
>>>> >>> >> way
>>>> >>> >> >> >> > >> > >> >>> on
>>>> >>> >> >> >> > that
>>>> >>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should
>>>> >>> >> >> >> > >> > >> >>> be
>>>> >>> >> cleaned
>>>> >>> >> >> >> > >> > >> >>> up before we cut a release. I started a
>>>> >>> >> >> >> > >> > >> >>> separate
>>>> >>> >> thread to
>>>> >>> >> >> >> > >> > >> >>> discuss
>>>> >>> >> >> >> > >> the
>>>> >>> >> >> >> > >> > findbugs issue.
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul
>>>> >>> >> >> >> > >> > >> >>> on
>>>> >>> 1919?
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >> >>> Patrick
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio
>>>> >>> >> >> >> > >> > >> >>> Junqueira
>>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>>>> >>> >> >> >> > >> > >> >>> > My take:
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker
>>>> >>> >> >> >> > >> > >> >>> > and
>>>> >>> it
>>>> >>> >> can
>>>> >>> >> >> >> > >> > >> >>> > go
>>>> >>> >> >> >> > in.
>>>> >>> >> >> >> > >> > >> >>> > See
>>>> >>> >> >> >> > >> > >> the
>>>> >>> >> >> >> > >> > >> >>> jira for comments.
>>>> >>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the
>>>> >>> >> >> >> > >> > >> >>> > first
>>>> >>> alpha.
>>>> >>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha
>>>> >>> >> >> >> > >> > >> >>> > depending on
>>>> >>> >> >> >> > >> > >> >>> > ZK-1919
>>>> >>> >> >> >> > and
>>>> >>> >> >> >> > >> > >> ZK-1910,
>>>> >>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>> > If you agree with this, then we should be
>>>> >>> >> >> >> > >> > >> >>> > able to
>>>> >>> >> cut a
>>>> >>> >> >> >> > >> > >> >>> > candidate by
>>>> >>> >> >> >> > >> > >> the
>>>> >>> >> >> >> > >> > >> >>> end of this week.
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>> > -Flavio
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>>>> >>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>>>> >>> >> >> >> > >> wrote:
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c
>>>> >>> >> >> >> > >> > >> >>> >> client
>>>> >>> >> test
>>>> >>> >> >> >> > >> > >> >>> >> log output
>>>> >>> >> >> >> > >> > >> here
>>>> >>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>>>> >>> >> >> >> > >> > >> >>> >>
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >>
>>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>>>> >>> >> >> >> > >> > >> r-
>>>> >>> >> >> >> > >> > trunk
>>>> >>> >> >> >> > >> > >> /2372/
>>>> >>> >> >> >> > >> > >> >>> >>
>>>> >>> >> >> >> > >> > >> >>> >> Patrick
>>>> >>> >> >> >> > >> > >> >>> >>
>>>> >>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick
>>>> >>> >> >> >> > >> > >> >>> >> Hunt
>>>> >>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>>>> >>> >> >> >> > >> > >> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0
>>>> >>> >> >> >> > >> > >> >>> >>> (not
>>>> >>> >> clear
>>>> >>> >> >> >> > >> > >> >>> >>> to me which
>>>> >>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported
>>>> >>> >> >> >> > >> > >> >>> >>> is
>>>> >>> hitting
>>>> >>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as
>>>> >>> >> >> >> > >> > >> >>> >>> well.
>>>> >>> I've
>>>> >>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>>>> >>> >> >> >> > >> > >> to
>>>> >>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So
>>>> >>> >> >> >> > >> > >> >>> >>> while it
>>>> >>> >> won't
>>>> >>> >> >> >> > >> > >> >>> >>> go
>>>> >>> >> >> >> > to
>>>> >>> >> >> >> > >> > >> console
>>>> >>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a
>>>> >>> >> >> >> > >> > >> >>> >>> failure.
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews
>>>> >>> >> >> >> > >> > >> >>> >>> for the
>>>> >>> >> jetty
>>>> >>> >> >> >> > server
>>>> >>> >> >> >> > >> > >> support
>>>> >>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready
>>>> >>> >> >> >> > >> > >> >>> >>> soon.
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Raul also requested that someone
>>>> >>> >> >> >> > >> > >> >>> >>> prioritize
>>>> >>> >> reviewing
>>>> >>> >> >> >> > >> > >> "ZOOKEEPER-1919
>>>> >>> >> >> >> > >> > >> >>> >>> Update the C implementation of
>>>> >>> >> >> >> > >> > >> >>> >>> removeWatches to
>>>> >>> >> have
>>>> >>> >> >> >> > >> > >> >>> >>> it
>>>> >>> >> >> >> > >> > match
>>>> >>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it
>>>> >>> >> >> >> > >> > >> >>> >>> in
>>>> >>> 3.5.0.
>>>> >>> >> >> >> > >> Flavio/Michi?
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the
>>>> >>> >> >> >> > >> > >> >>> >>> flakey c
>>>> >>> >> client
>>>> >>> >> >> >> > >> > >> >>> >>> reconfig
>>>> >>> >> >> >> > >> > >> test -
>>>> >>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test
>>>> >>> >> >> >> > >> > >> >>> >>> infra!
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like
>>>> >>> >> >> >> > >> > >> >>> >>> we're
>>>> >>> >> pretty
>>>> >>> >> >> >> > close.
>>>> >>> >> >> >> > >> > >> >>> >>> Do
>>>> >>> >> >> >> > >> > >> folks
>>>> >>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at
>>>> >>> >> >> >> > >> > >> >>> >>> this
>>>> >>> point?
>>>> >>> >> >> >> > >> > >> >>> >>> (with a few
>>>> >>> >> >> >> > >> > >> pending
>>>> >>> >> >> >> > >> > >> >>> >>> as above)
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> Patrick
>>>> >>> >> >> >> > >> > >> >>> >>>
>>>> >>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl
>>>> >>> >> >> >> > >> > >> >>> >>> Gutiérrez
>>>> >>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>> Junqueira"
>>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much,
>>>> >>> >> >> >> > >> > >> >>> >>>>> what if
>>>> >>> we
>>>> >>> >> >> >> > >> > >> >>> >>>>> release
>>>> >>> >> >> >> > an
>>>> >>> >> >> >> > >> > >> >>> >>>>> alpha
>>>> >>> >> >> >> > >> > >> >>> version
>>>> >>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one
>>>> >>> >> >> >> > >> > >> >>> >>>> in
>>>> >>> 2-3
>>>> >>> >> >> >> > >> > >> >>> >>>> weeks
>>>> >>> >> >> >> > time?
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>> >>> >> >> >> > >> > >> >>> >>>> +1
>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>> >>> >> >> >> > >> > >> >>> >>>> -rgs
>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>> -Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>>>> >>> Gutiérrez
>>>> >>> >> >> >> > Segalés <
>>>> >>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>>>> >>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>>>> >>> >> >> >> > >> > wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> 5.1.0
>>>> >>> >> (from 8
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>>> >>> >> >> >> > the
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> last
>>>> >>> >> >> >> > >> > >> >>> check).
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Michi, and
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>>>> >>> >> >> >> > >> > >> threatening
>>>> >>> >> >> >> > >> > >> >>> to
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>>> >>> >> general
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> on
>>>> >>> >> >> >> > the
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> to
>>>> >>> see.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> consider -
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>>>> >>> >> >> >> > >> > >> jdk6
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> by
>>>> >>> Oracle
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>>>> >>> >> >> >> > >> > >> some
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> gotta
>>>> >>> move
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>>>> >>> >> >> >> > >> > >> >>> can't
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> that we
>>>> >>> are
>>>> >>> >> >> >> > currently
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>>> >>> 8.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>>>> >>> >> >> >> > >> > >> >>> >>>>
>>>> >>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>>> >>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone
>>>> >>> >> >> >> > >> > >> >>> >>>>>> using
>>>> >>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>>>> >>> >> >> >> > >> > >> >>> upcoming
>>>> >>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network
>>>> >>> >> >> >> > >> > >> >>> >>>>>> usage go
>>>> >>> >> >> >> wild...).
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>> -rgs
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>>>> >>> >> Junqueira
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>>>> >>> already,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>>>> >>> >> >> >> > >> > >> >>> >>>> there. I
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> LETest
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>>>> >>> >> >> >> > >> > >> last QA
>>>> >>> >> >> >> > >> > >> >>> run
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> affect
>>>> >>> >> LETest,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>>> >>> >> >> >> > in
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> fact
>>>> >>> >> >> >> > >> > >> it
>>>> >>> >> >> >> > >> > >> >>> fails
>>>> >>> >> >> >> > >> > >> >>> >>>> in
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> failure
>>>> >>> >> doesn't
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> seem
>>>> >>> >> >> >> > to
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> be
>>>> >>> >> >> >> > >> > >> >>> related
>>>> >>> >> >> >> > >> > >> >>> >>>> to the
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> say
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>>>> >>> >> >> >> > >> > >> about
>>>> >>> >> >> >> > >> > >> >>> it.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>> Patrick
>>>> >>> >> Hunt <
>>>> >>> >> >> >> > >> > >> phunt@apache.org>
>>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> can
>>>> >>> get
>>>> >>> >> >> >> > >> > >> reviewed/finalized
>>>> >>> >> >> >> > >> > >> >>> before
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>>>> >>> cutting
>>>> >>> >> an
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>>>> >>> >> >> >> > >> > soonish...
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid>
>>>> >>> >> >> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>>>> >>> versions.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810)
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> and
>>>> >>> >> ZK-1863
>>>> >>> >> >> in.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>>>> >>> >> >> >> > >> > >> both
>>>> >>> >> >> >> > >> > >> >>> >>>> patch
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> is
>>>> >>> still
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> open because we
>>>> >>> >> >> >> > >> > >> >>> need a
>>>> >>> >> >> >> > >> > >> >>> >>>> 3.4
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> Hunt
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>>>> >>> >> >> >> > >> > >> >>> wrote:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> about it
>>>> >>> for
>>>> >>> >> a
>>>> >>> >> >> >> > while, a
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>>>> >>> >> >> >> > >> > >> >>> people
>>>> >>> >> >> >> > >> > >> >>> >>>> have
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>>>> >>> contacted
>>>> >>> >> me
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>>>> >>> >> >> >> > >> > >> that
>>>> >>> >> >> >> > >> > >> >>> they
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>> >>> >> first
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>>>> >>> >> >> >> > >> > release.
>>>> >>> >> >> >> > >> > >> Every
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> wait for
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>>>> >>> >> >> >> > >> > >> never
>>>> >>> >> >> >> > >> > >> >>> get
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> tons of
>>>> >>> >> great
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>>>> >>> >> >> >> > >> > >> >>> lots of
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>>>> >>> release so
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>>>> >>> >> >> >> > >> > >> use
>>>> >>> >> >> >> > >> > >> >>> it,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> stable
>>>> >>> except
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>>>> >>> >> >> >> > >> > >> >>> flakey
>>>> >>> >> >> >> > >> > >> >>> >>>> test
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> committed
>>>> >>> >> today to
>>>> >>> >> >> >> > >> > trunk.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>>>> >>> >> >> >> > >> > >> that
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>> >>> code on
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>>>> >>> >> >> >> > and
>>>> >>> >> >> >> > >> > jdk8.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>>>> >>> should
>>>> >>> >> plan
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>>>> >>> >> >> >> > >> > >> releases:
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>>>> >>> >> >> >> > for
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>>>> >>> >> >> >> > >> > >> time.
>>>> >>> >> >> >> > >> > >> >>> >>>> What I
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> similar to
>>>> >>> >> what
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>>>> >>> >> >> >> > did
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>>>> >>> >> >> >> > >> > >> 3.4.
>>>> >>> >> >> >> > >> > >> >>> >>>> (this is
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> during
>>>> >>> >> their
>>>> >>> >> >> >> > Hadoop 2
>>>> >>> >> >> >> > >> > >> release
>>>> >>> >> >> >> > >> > >> >>> >>>> cycle)
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>>>> >>> >> >> >> > >> > >> can run
>>>> >>> >> >> >> > >> > >> >>> >>>> and
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>>>> >>> blockers
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>>>> >>> >> >> >> > feel
>>>> >>> >> >> >> > >> > >> >>> comfortable
>>>> >>> >> >> >> > >> > >> >>> >>>> with
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>>>> >>> switch
>>>> >>> >> to
>>>> >>> >> >> >> beta.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>>>> >>> >> >> >> > >> > >> get
>>>> >>> >> >> >> > >> > >> >>> >>>> some
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>>> >>> >> moniker
>>>> >>> >> >> >> > >> > and
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>>>> >>> >> >> >> > >> > >> >>> making
>>>> >>> >> >> >> > >> > >> >>> >>>> it
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> will
>>>> >>> >> become
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>>> >>> >> >> >> > >> > >> >>> "current/stable"
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.1-alpha (3
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>>>> >>> >> 3.5.3-beta
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.5-beta
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>>> >>> but
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>>>> >>> >> >> >> > >> > >> "stable" vs
>>>> >>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> still
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>>>> >>> >> >> >> > >> > >> shake
>>>> >>> >> >> >> > >> > >> >>> >>>> out)
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases
>>>> >>> for
>>>> >>> >> >> >> > production
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>>>> >>> >> >> >> > >> > >> >>> stable,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>> etc...
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently,
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> are any
>>>> >>> of
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>>>> >>> >> >> >> > >> > >> that
>>>> >>> >> >> >> > >> > >> >>> >>>> should
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>>> >>> couple
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>>>> >>> >> >> >> > >> > >> find
>>>> >>> >> >> >> > >> > >> >>> >>>> this a
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ball
>>>> >>> >> rolling to
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>>>> >>> >> >> >> > an
>>>> >>> >> >> >> > >> RC.
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>>> >>> >> >> >> > >> > >> >>> >
>>>> >>> >> >> >> > >> > >> >>>
>>>> >>> >> >> >> > >> > >>
>>>> >>> >> >> >> > >>
>>>> >>> >> >> >> > >>
>>>> >>> >> >> >> >
>>>> >>> >> >> >>
>>>> >>> >> >>
>>>> >>> >>
>>>> >>>
>>>
>>>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Thanks Alex. I've created a jira for this:
https://issues.apache.org/jira/browse/ZOOKEEPER-1984 Let's discuss
further there.

I will try the patch on my jenkins box later today.

Thanks!

Patrick

On Tue, Jul 22, 2014 at 2:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
> Actually if servers 1 and 3 are talking and 3 is elected and not 1, it means
> that 3 also saw the reconfig. So it should also complete it when it reboots.
> To debug this I suggest to print out the last seen config in the beginning
> of leader.lead().
>
> Is it possible that writing the .next file to disk fails ?
>
> Alternatively we could just remove this part of the test (attached patch) -
> the test's goal is to check that the leader times out when it looses a
> quorum of the new config, and the part of the test that fails now is not
> needed to check that. There are other tests in ReconfigRecoveryTest that are
> supposed to check recovery.
>
>
>
> On Tue, Jul 22, 2014 at 1:07 PM, Alexander Shraer <sh...@gmail.com> wrote:
>>
>> yep, I think what happens is that server 3 is becoming leader and not
>> server 1, so its not completing the reconfig. Let me think about how to
>> solve this...
>>
>>
>> On Tue, Jul 22, 2014 at 12:21 PM, Patrick Hunt <ph...@apache.org> wrote:
>>>
>>> Also if you want to submit a patch that provides more insight (logs)
>>> for that operation/test lmk and I'll be happy to review/commit it.
>>> Should help with debugging the issue and debugging in the field.
>>>
>>> Thanks!
>>>
>>> Patrick
>>>
>>> On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
>>> > Here's the logs (attached) for the test that failed. Nothing stuck out
>>> > at me - anything ring a bell?
>>> >
>>> > Patrick
>>> >
>>> > On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com>
>>> > wrote:
>>> >> Unfortunately doesn't look like we have enough logging going on there.
>>> >> For example would be nice to know what's the committed config and last
>>> >> seen
>>> >> config
>>> >> of the leader when it comes up (leader.lead()). and what configuration
>>> >> is
>>> >> sent in the NEWLEADER message
>>> >> sent out in LeaderHandler:
>>> >>
>>> >>                 QuorumPacket newLeaderQP = new
>>> >> QuorumPacket(Leader.NEWLEADER,
>>> >>                         newLeaderZxid,
>>> >> leader.self.getLastSeenQuorumVerifier()
>>> >>                                 .toString().getBytes(), null);
>>> >>
>>> >>
>>> >> I didn't know about the option to have a separate administrative
>>> >> interface,
>>> >> and just followed the flow of other commands... I agree that it would
>>> >> be
>>> >> cleaner.
>>> >>
>>> >>
>>> >>
>>> >>
>>> >> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org>
>>> >> wrote:
>>> >>
>>> >>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer
>>> >>> <sh...@gmail.com>
>>> >>> wrote:
>>> >>> > Hmm. It doesn't really make sense to me - the reconfig should be
>>> >>> completed
>>> >>> > before
>>> >>> > the servers come up and process new ops. We submitted the reconfig
>>> >>> > to
>>> >>> > server 1, it timed out
>>> >>> > on new quorum, but when 1 becomes leader again after 2 restarts 1
>>> >>> > should
>>> >>> > complete the reconfig.
>>> >>> > is 1 becoming leader after 2 restarts ?
>>> >>> >
>>> >>>
>>> >>> What should I look for in the logs? Any specific log messages that
>>> >>> would help debug?
>>> >>>
>>> >>> > About admin controls - reconfig/getConfig are open to everyone,
>>> >>> > unless
>>> >>> you
>>> >>> > set permissions on the configuration znode being written during
>>> >>> > reconfig.
>>> >>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>>> >>> >
>>> >>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>>> >>> > request.authInfo);
>>> >>> >
>>> >>>
>>> >>> So I can turn off all access then? (read and write). Should we ship
>>> >>> that as the default? We should add that to the docs.
>>> >>>
>>> >>> In the past we've always tried to hide this type of information from
>>> >>> clients (e.g. we don't expose the zk server address to the client for
>>> >>> a session). This seems like a very big departure. Why didn't we move
>>> >>> it to a separate, administrative, interface?
>>> >>>
>>> >>> Patrick
>>> >>>
>>> >>> >
>>> >>> >
>>> >>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com>
>>> >>> > wrote:
>>> >>> >
>>> >>> >> Looks like 3 hasn't been removed (unfortunately the assertion
>>> >>> >> doesn't
>>> >>> >> include any msg detail, but that's the way it looks to me like the
>>> >>> >> test is setup):
>>> >>> >>
>>> >>> >>         if (leavingServers != null) {
>>> >>> >>             for (String leaving : leavingServers)
>>> >>> >>
>>> >>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>>> >>> >>         }
>>> >>> >>
>>> >>> >> which is called from:
>>> >>> >>
>>> >>> >>         qu.restart(2);
>>> >>> >>         // Now that 2 is back up, they'll complete the reconfig
>>> >>> removing 3
>>> >>> >> and
>>> >>> >>         // can process other ops.
>>> >>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>>> >>> >>
>>> >>> >> It seems like the problem is that testServerHasConfig is not
>>> >>> >> waiting
>>> >>> >> for the configuration to be updated? In this case 2 was just
>>> >>> >> restarted
>>> >>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>>> >>> >> which might be why you aren't seeing the issue? hence the
>>> >>> >> flakeyness)
>>> >>> >>
>>> >>> >> Patrick
>>> >>> >>
>>> >>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer
>>> >>> >> <sh...@gmail.com>
>>> >>> >> wrote:
>>> >>> >> > Hi Patrick, I'm not sure why you're seeing this - it
>>> >>> >> > consistently
>>> >>> passes
>>> >>> >> on
>>> >>> >> > my machine. In case you'd like to take a look, the test has tons
>>> >>> >> > of
>>> >>> >> > comments explaining the scenario. Let me know how I can help.
>>> >>> >> >
>>> >>> >> >
>>> >>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>>> >>> wrote:
>>> >>> >> >
>>> >>> >> >> Hi Alex, I've also seen the test
>>> >>> >> >> "testLeaderTimesoutOnNewQuorum" fail
>>> >>> >> >> multiple times (not every time, but ~50%, so flakey) in the
>>> >>> >> >> last few
>>> >>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>>> >>> >> >> jenkins, I haven't see any other failures than this during the
>>> >>> >> >> past
>>> >>> >> >> few days).
>>> >>> >> >>
>>> >>> >> >> junit.framework.AssertionFailedError
>>> >>> >> >> at
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>> >>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>>> >>> >> >> at
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>> >>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>>> >>> >> >> at
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>> >>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>> >>> >> >>
>>> >>> >> >> Patrick
>>> >>> >> >>
>>> >>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer
>>> >>> >> >> <shralex@gmail.com
>>> >>> >
>>> >>> >> >> wrote:
>>> >>> >> >> > Hi Rakesh,
>>> >>> >> >> >
>>> >>> >> >> > Thanks for looking at this. In general even if we find the
>>> >>> >> >> > bug
>>> >>> since
>>> >>> >> we
>>> >>> >> >> > should test it before committing a fix, it seems better to
>>> >>> >> >> > remove
>>> >>> the
>>> >>> >> >> test
>>> >>> >> >> > for now and debug this on a build machine. I'm trying to get
>>> >>> access to
>>> >>> >> >> it.
>>> >>> >> >> >
>>> >>> >> >> > Looking at this log:
>>> >>> >> >> >
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>> >>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>>> >>> >> >> >
>>> >>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>>> >>> version
>>> >>> >> >> 200000000
>>> >>> >> >> > is already being sent around as committed!
>>> >>> >> >> >
>>> >>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>>> >>> >> >> >
>>> >>> >>
>>> >>> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>>> >>> ]
>>> >>> >> >> > - 2 Received version: 200000000 my version: 0
>>> >>> >> >> >
>>> >>> >> >> >
>>> >>> >> >> > and also in leader election messages.
>>> >>> >> >> >
>>> >>> >> >> > Also weird is that the version of 2 is 0 as if it is a
>>> >>> >> >> > joiner,
>>> >>> >> whereas we
>>> >>> >> >> > explicitly started it with 100000000.
>>> >>> >> >> > Then it makes sense that the new config can't be committed
>>> >>> >> >> > since
>>> >>> its
>>> >>> >> >> > version is not high enough...
>>> >>> >> >> >
>>> >>> >> >> > I wonder if its possible that not all servers from the
>>> >>> >> >> > previous
>>> >>> test
>>> >>> >> are
>>> >>> >> >> > dead and they are interfering...
>>> >>> >> >> >
>>> >>> >> >> >
>>> >>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R
>>> >>> >> >> > <ra...@huawei.com>
>>> >>> wrote:
>>> >>> >> >> >
>>> >>> >> >> >> Hi Alex,
>>> >>> >> >> >>
>>> >>> >> >> >> Yeah it is consistently passing in my machine also.
>>> >>> >> >> >>
>>> >>> >> >> >>
>>> >>> >> >> >> I have quickly gone through the
>>> >>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>>> >>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>>> >>> version)
>>> >>> >> >> has
>>> >>> >> >> >> not taken and still leader election is seeing 100000000
>>> >>> >> >> >> (n.config
>>> >>> >> >> version).
>>> >>> >> >> >> Unfortunately I didn't find the reason for not considering
>>> >>> >> >> >> the
>>> >>> >> updated
>>> >>> >> >> >> config version.
>>> >>> >> >> >>
>>> >>> >> >> >>
>>> >>> >> >> >> Reference:
>>> >>> >> >> >>
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>> >>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>> >>> >> >> >>
>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922]
>>> >>> >> >> >> -
>>> >>> >> >> >> Notification time out: 51200
>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>> >>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922]
>>> >>> >> >> >> -
>>> >>> >> >> >> Notification time out: 51200
>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >>
>>> >>> >> >> >>
>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>>> >>> >> >> >> Notification:
>>> >>> 2
>>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>>> >>> >> >> >> (n.zxid), 0x1
>>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> >>> LOOKING
>>> >>> >> (my
>>> >>> >> >> >> state)100000000 (n.config version)
>>> >>> >> >> >>
>>> >>> >> >> >>
>>> >>> >> >> >> -Rakesh
>>> >>> >> >> >>
>>> >>> >> >> >> -----Original Message-----
>>> >>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>>> >>> >> >> >> Sent: 22 July 2014 11:57
>>> >>> >> >> >> To: dev@zookeeper.apache.org
>>> >>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >>> >> >> >>
>>> >>> >> >> >> I tried to look into it, but the test consistently passes
>>> >>> >> >> >> locally
>>> >>> on
>>> >>> >> two
>>> >>> >> >> >> machines.
>>> >>> >> >> >> I don't currently have access to the build machine, but I
>>> >>> >> >> >> can try
>>> >>> to
>>> >>> >> ask
>>> >>> >> >> >> for access.
>>> >>> >> >> >> Unless anyone has a better suggestion, we could remove the
>>> >>> >> >> >> failing
>>> >>> >> test
>>> >>> >> >> in
>>> >>> >> >> >> the meanwhile and open a JIRA to add it back...
>>> >>> >> >> >>
>>> >>> >> >> >>
>>> >>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt
>>> >>> >> >> >> <ph...@apache.org>
>>> >>> >> >> wrote:
>>> >>> >> >> >>
>>> >>> >> >> >> > I'm seeing alot of test failures in
>>> >>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone
>>> >>> >> >> >> > take a
>>> >>> >> look?
>>> >>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>>> >>> >> >> >> >
>>> >>> >> >> >> >
>>> >>> >> >> >> >
>>> >>> >> >>
>>> >>>
>>> >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>>> >>> >> >> >> >
>>> >>> >>
>>> >>> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>>> >>> >> >> >> > tabpanel#comment-14069024
>>> >>> >> >> >> >
>>> >>> >> >> >> > Patrick
>>> >>> >> >> >> >
>>> >>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>>> >>> >> >> >> > <ra...@gmail.com> wrote:
>>> >>> >> >> >> > > lgtm +1
>>> >>> >> >> >> > >
>>> >>> >> >> >> > >
>>> >>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>>> >>> >> >> >> > > <fp...@yahoo.com.invalid>
>>> >>> >> >> >> > wrote:
>>> >>> >> >> >> > >
>>> >>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>>> >>> release, I
>>> >>> >> >> >> > >> +think
>>> >>> >> >> >> > 72
>>> >>> >> >> >> > >> biz hours is enough for the vote.
>>> >>> >> >> >> > >>
>>> >>> >> >> >> > >> -Flavio
>>> >>> >> >> >> > >>
>>> >>> >> >> >> > >> > -----Original Message-----
>>> >>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>>> >>> >> >> >> > >> > Sent: 21 July 2014 18:55
>>> >>> >> >> >> > >> > To: DevZooKeeper
>>> >>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >>> >> >> >> > >> >
>>> >>> >> >> >> > >> > I fixed a number of issues. I also started a few
>>> >>> >> >> >> > >> > threads
>>> >>> with
>>> >>> >> >> >> > >> > builds@
>>> >>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and
>>> >>> >> >> >> > >> > I
>>> >>> worked
>>> >>> >> >> >> > through a
>>> >>> >> >> >> > >> > number of findbugs issues, it's not closed yet but
>>> >>> >> >> >> > >> > it's
>>> >>> pretty
>>> >>> >> >> >> close.
>>> >>> >> >> >> > >> >
>>> >>> >> >> >> > >> > I don't see why we can't create an RC and start
>>> >>> >> >> >> > >> > voting this
>>> >>> >> week
>>> >>> >> >> >> > though.
>>> >>> >> >> >> > >> > Anyone disagree?
>>> >>> >> >> >> > >> >
>>> >>> >> >> >> > >> > How long should we let the vote run, the std 72 biz
>>> >>> >> >> >> > >> > hours
>>> >>> or
>>> >>> >> >> >> > >> > should we
>>> >>> >> >> >> > >> plan
>>> >>> >> >> >> > >> > for more to allow folks more time to test?
>>> >>> >> >> >> > >> >
>>> >>> >> >> >> > >> > Patrick
>>> >>> >> >> >> > >> >
>>> >>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez
>>> >>> >> >> >> > >> > Segalés
>>> >>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>>> >>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt
>>> >>> >> >> >> > >> > > <ph...@apache.org>
>>> >>> >> wrote:
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins
>>> >>> >> >> >> > >> > >> ZK
>>> >>> jobs -
>>> >>> >> I'm
>>> >>> >> >> >> > trying
>>> >>> >> >> >> > >> > >> to fix some of the jobs that were broken during
>>> >>> >> >> >> > >> > >> the
>>> >>> recent
>>> >>> >> >> >> > >> > >> host upgrade.
>>> >>> >> >> >> > >> > >>
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > > How are things looking? Is it likely that we can
>>> >>> >> >> >> > >> > > have a
>>> >>> >> 3.5.0
>>> >>> >> >> >> > >> > > alpha release week or are we still blocked on
>>> >>> >> >> >> > >> > > Jenkins?
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > > -rgs
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >
>>> >>> >> >> >> > >> > >> Patrick
>>> >>> >> >> >> > >> > >>
>>> >>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>>> >>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>>> >>> >> >> >> > >> > >> wrote:
>>> >>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>>> >>> >> >> >> > >> > >> >
>>> >>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander
>>> >>> >> >> >> > >> > >> > Shraer
>>> >>> >> >> >> > >> > >> > <sh...@gmail.com>
>>> >>> >> >> >> > >> > >> wrote:
>>> >>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille
>>> >>> >> >> >> > >> > >> >> gave a
>>> >>> +1
>>> >>> >> and
>>> >>> >> >> >> > >> > >> >> all
>>> >>> >> >> >> > >> > >> subsequent
>>> >>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>>> >>> >> >> >> > >> > >> >>
>>> >>> >> >> >> > >> > >> >>
>>> >>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>>> >>> >> >> >> > >> > >> >> <phunt@apache.org
>>> >>> >> >> >> > >
>>> >>> >> >> >> > >> > wrote:
>>> >>> >> >> >> > >> > >> >>
>>> >>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all
>>> >>> >> >> >> > >> > >> >>> failing due
>>> >>> to,
>>> >>> >> >> >> > >> > >> >>> for
>>> >>> >> >> >> > e.g.
>>> >>> >> >> >> > >> > >> >>> findbugs issues. At the very least our
>>> >>> >> >> >> > >> > >> >>> build/test/ci
>>> >>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok
>>> >>> >> >> >> > >> > >> >>> (the
>>> >>> recent
>>> >>> >> >> >> > >> > >> >>> startServer fix
>>> >>> >> >> >> > and
>>> >>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go
>>> >>> >> >> >> > >> > >> >>> a
>>> >>> long
>>> >>> >> way
>>> >>> >> >> >> > >> > >> >>> on
>>> >>> >> >> >> > that
>>> >>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should
>>> >>> >> >> >> > >> > >> >>> be
>>> >>> >> cleaned
>>> >>> >> >> >> > >> > >> >>> up before we cut a release. I started a
>>> >>> >> >> >> > >> > >> >>> separate
>>> >>> >> thread to
>>> >>> >> >> >> > >> > >> >>> discuss
>>> >>> >> >> >> > >> the
>>> >>> >> >> >> > >> > findbugs issue.
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul
>>> >>> >> >> >> > >> > >> >>> on
>>> >>> 1919?
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >> >>> Patrick
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio
>>> >>> >> >> >> > >> > >> >>> Junqueira
>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>>> >>> >> >> >> > >> > >> >>> > My take:
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker
>>> >>> >> >> >> > >> > >> >>> > and
>>> >>> it
>>> >>> >> can
>>> >>> >> >> >> > >> > >> >>> > go
>>> >>> >> >> >> > in.
>>> >>> >> >> >> > >> > >> >>> > See
>>> >>> >> >> >> > >> > >> the
>>> >>> >> >> >> > >> > >> >>> jira for comments.
>>> >>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the
>>> >>> >> >> >> > >> > >> >>> > first
>>> >>> alpha.
>>> >>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha
>>> >>> >> >> >> > >> > >> >>> > depending on
>>> >>> >> >> >> > >> > >> >>> > ZK-1919
>>> >>> >> >> >> > and
>>> >>> >> >> >> > >> > >> ZK-1910,
>>> >>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>> > If you agree with this, then we should be
>>> >>> >> >> >> > >> > >> >>> > able to
>>> >>> >> cut a
>>> >>> >> >> >> > >> > >> >>> > candidate by
>>> >>> >> >> >> > >> > >> the
>>> >>> >> >> >> > >> > >> >>> end of this week.
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>> > -Flavio
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>>> >>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>>> >>> >> >> >> > >> wrote:
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c
>>> >>> >> >> >> > >> > >> >>> >> client
>>> >>> >> test
>>> >>> >> >> >> > >> > >> >>> >> log output
>>> >>> >> >> >> > >> > >> here
>>> >>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>>> >>> >> >> >> > >> > >> >>> >>
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >>
>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>>> >>> >> >> >> > >> > >> r-
>>> >>> >> >> >> > >> > trunk
>>> >>> >> >> >> > >> > >> /2372/
>>> >>> >> >> >> > >> > >> >>> >>
>>> >>> >> >> >> > >> > >> >>> >> Patrick
>>> >>> >> >> >> > >> > >> >>> >>
>>> >>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick
>>> >>> >> >> >> > >> > >> >>> >> Hunt
>>> >>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>>> >>> >> >> >> > >> > >> wrote:
>>> >>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0
>>> >>> >> >> >> > >> > >> >>> >>> (not
>>> >>> >> clear
>>> >>> >> >> >> > >> > >> >>> >>> to me which
>>> >>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported
>>> >>> >> >> >> > >> > >> >>> >>> is
>>> >>> hitting
>>> >>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as
>>> >>> >> >> >> > >> > >> >>> >>> well.
>>> >>> I've
>>> >>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>>> >>> >> >> >> > >> > >> to
>>> >>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So
>>> >>> >> >> >> > >> > >> >>> >>> while it
>>> >>> >> won't
>>> >>> >> >> >> > >> > >> >>> >>> go
>>> >>> >> >> >> > to
>>> >>> >> >> >> > >> > >> console
>>> >>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a
>>> >>> >> >> >> > >> > >> >>> >>> failure.
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews
>>> >>> >> >> >> > >> > >> >>> >>> for the
>>> >>> >> jetty
>>> >>> >> >> >> > server
>>> >>> >> >> >> > >> > >> support
>>> >>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready
>>> >>> >> >> >> > >> > >> >>> >>> soon.
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Raul also requested that someone
>>> >>> >> >> >> > >> > >> >>> >>> prioritize
>>> >>> >> reviewing
>>> >>> >> >> >> > >> > >> "ZOOKEEPER-1919
>>> >>> >> >> >> > >> > >> >>> >>> Update the C implementation of
>>> >>> >> >> >> > >> > >> >>> >>> removeWatches to
>>> >>> >> have
>>> >>> >> >> >> > >> > >> >>> >>> it
>>> >>> >> >> >> > >> > match
>>> >>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it
>>> >>> >> >> >> > >> > >> >>> >>> in
>>> >>> 3.5.0.
>>> >>> >> >> >> > >> Flavio/Michi?
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the
>>> >>> >> >> >> > >> > >> >>> >>> flakey c
>>> >>> >> client
>>> >>> >> >> >> > >> > >> >>> >>> reconfig
>>> >>> >> >> >> > >> > >> test -
>>> >>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test
>>> >>> >> >> >> > >> > >> >>> >>> infra!
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like
>>> >>> >> >> >> > >> > >> >>> >>> we're
>>> >>> >> pretty
>>> >>> >> >> >> > close.
>>> >>> >> >> >> > >> > >> >>> >>> Do
>>> >>> >> >> >> > >> > >> folks
>>> >>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at
>>> >>> >> >> >> > >> > >> >>> >>> this
>>> >>> point?
>>> >>> >> >> >> > >> > >> >>> >>> (with a few
>>> >>> >> >> >> > >> > >> pending
>>> >>> >> >> >> > >> > >> >>> >>> as above)
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> Patrick
>>> >>> >> >> >> > >> > >> >>> >>>
>>> >>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl
>>> >>> >> >> >> > >> > >> >>> >>> Gutiérrez
>>> >>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio
>>> >>> >> >> >> > >> > >> >>> >>>> Junqueira"
>>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much,
>>> >>> >> >> >> > >> > >> >>> >>>>> what if
>>> >>> we
>>> >>> >> >> >> > >> > >> >>> >>>>> release
>>> >>> >> >> >> > an
>>> >>> >> >> >> > >> > >> >>> >>>>> alpha
>>> >>> >> >> >> > >> > >> >>> version
>>> >>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one
>>> >>> >> >> >> > >> > >> >>> >>>> in
>>> >>> 2-3
>>> >>> >> >> >> > >> > >> >>> >>>> weeks
>>> >>> >> >> >> > time?
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>
>>> >>> >> >> >> > >> > >> >>> >>>> +1
>>> >>> >> >> >> > >> > >> >>> >>>>
>>> >>> >> >> >> > >> > >> >>> >>>> -rgs
>>> >>> >> >> >> > >> > >> >>> >>>>
>>> >>> >> >> >> > >> > >> >>> >>>>> -Flavio
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>>> >>> Gutiérrez
>>> >>> >> >> >> > Segalés <
>>> >>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>>> >>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>>> >>> >> >> >> > >> > wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on
>>> >>> >> >> >> > >> > >> >>> >>>>>>> 5.1.0
>>> >>> >> (from 8
>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>> >>> >> >> >> > the
>>> >>> >> >> >> > >> > >> >>> >>>>>>> last
>>> >>> >> >> >> > >> > >> >>> check).
>>> >>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from
>>> >>> >> >> >> > >> > >> >>> >>>>>>> Michi, and
>>> >>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>>> >>> >> >> >> > >> > >> threatening
>>> >>> >> >> >> > >> > >> >>> to
>>> >>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress
>>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>>> >>> >> general
>>> >>> >> >> >> > >> > >> >>> >>>>>>> on
>>> >>> >> >> >> > the
>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great
>>> >>> >> >> >> > >> > >> >>> >>>>>>> to
>>> >>> see.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might
>>> >>> >> >> >> > >> > >> >>> >>>>>>> consider -
>>> >>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>>> >>> >> >> >> > >> > >> jdk6
>>> >>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL
>>> >>> >> >> >> > >> > >> >>> >>>>>>> by
>>> >>> Oracle
>>> >>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>>> >>> >> >> >> > >> > >> some
>>> >>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We
>>> >>> >> >> >> > >> > >> >>> >>>>>>> gotta
>>> >>> move
>>> >>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>>> >>> >> >> >> > >> > >> >>> can't
>>> >>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note
>>> >>> >> >> >> > >> > >> >>> >>>>>>> that we
>>> >>> are
>>> >>> >> >> >> > currently
>>> >>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7
>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>> >>> 8.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>>> >>> >> >> >> > >> > >> >>> >>>>
>>> >>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>> >>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone
>>> >>> >> >> >> > >> > >> >>> >>>>>> using
>>> >>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>>> >>> >> >> >> > >> > >> >>> upcoming
>>> >>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network
>>> >>> >> >> >> > >> > >> >>> >>>>>> usage go
>>> >>> >> >> >> wild...).
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>> -rgs
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>>> >>> >> Junqueira
>>> >>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>>> >>> already,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>>> >>> >> >> >> > >> > >> >>> >>>> there. I
>>> >>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because
>>> >>> >> >> >> > >> > >> >>> >>>>>>> LETest
>>> >>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>>> >>> >> >> >> > >> > >> last QA
>>> >>> >> >> >> > >> > >> >>> run
>>> >>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't
>>> >>> >> >> >> > >> > >> >>> >>>>>>> affect
>>> >>> >> LETest,
>>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>>> >>> >> >> >> > in
>>> >>> >> >> >> > >> > >> >>> >>>>>>> fact
>>> >>> >> >> >> > >> > >> it
>>> >>> >> >> >> > >> > >> >>> fails
>>> >>> >> >> >> > >> > >> >>> >>>> in
>>> >>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test
>>> >>> >> >> >> > >> > >> >>> >>>>>>> failure
>>> >>> >> doesn't
>>> >>> >> >> >> > >> > >> >>> >>>>>>> seem
>>> >>> >> >> >> > to
>>> >>> >> >> >> > >> > >> >>> >>>>>>> be
>>> >>> >> >> >> > >> > >> >>> related
>>> >>> >> >> >> > >> > >> >>> >>>> to the
>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> say
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>>> >>> >> >> >> > >> > >> about
>>> >>> >> >> >> > >> > >> >>> it.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>> Patrick
>>> >>> >> Hunt <
>>> >>> >> >> >> > >> > >> phunt@apache.org>
>>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> can
>>> >>> get
>>> >>> >> >> >> > >> > >> reviewed/finalized
>>> >>> >> >> >> > >> > >> >>> before
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>>> >>> cutting
>>> >>> >> an
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>>> >>> >> >> >> > >> > soonish...
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Flavio
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid>
>>> >>> >> >> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>>> >>> versions.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810)
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> and
>>> >>> >> ZK-1863
>>> >>> >> >> in.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>>> >>> >> >> >> > >> > >> both
>>> >>> >> >> >> > >> > >> >>> >>>> patch
>>> >>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it
>>> >>> >> >> >> > >> > >> >>> >>>>>>> is
>>> >>> still
>>> >>> >> >> >> > >> > >> >>> >>>>>>> open because we
>>> >>> >> >> >> > >> > >> >>> need a
>>> >>> >> >> >> > >> > >> >>> >>>> 3.4
>>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> Hunt
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>>> >>> >> >> >> > >> > >> >>> wrote:
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> about it
>>> >>> for
>>> >>> >> a
>>> >>> >> >> >> > while, a
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>>> >>> >> >> >> > >> > >> >>> people
>>> >>> >> >> >> > >> > >> >>> >>>> have
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>>> >>> contacted
>>> >>> >> me
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>>> >>> >> >> >> > >> > >> that
>>> >>> >> >> >> > >> > >> >>> they
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>> >>> >> first
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>>> >>> >> >> >> > >> > release.
>>> >>> >> >> >> > >> > >> Every
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> wait for
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>>> >>> >> >> >> > >> > >> never
>>> >>> >> >> >> > >> > >> >>> get
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> tons of
>>> >>> >> great
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>>> >>> >> >> >> > >> > >> >>> lots of
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>>> >>> release so
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>>> >>> >> >> >> > >> > >> use
>>> >>> >> >> >> > >> > >> >>> it,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> stable
>>> >>> except
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>>> >>> >> >> >> > >> > >> >>> flakey
>>> >>> >> >> >> > >> > >> >>> >>>> test
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> committed
>>> >>> >> today to
>>> >>> >> >> >> > >> > trunk.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>>> >>> >> >> >> > >> > >> that
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>> >>> code on
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>>> >>> >> >> >> > and
>>> >>> >> >> >> > >> > jdk8.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>>> >>> should
>>> >>> >> plan
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>>> >>> >> >> >> > >> > >> releases:
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>>> >>> >> >> >> > for
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>>> >>> >> >> >> > >> > >> time.
>>> >>> >> >> >> > >> > >> >>> >>>> What I
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> similar to
>>> >>> >> what
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>>> >>> >> >> >> > did
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>>> >>> >> >> >> > >> > >> 3.4.
>>> >>> >> >> >> > >> > >> >>> >>>> (this is
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> during
>>> >>> >> their
>>> >>> >> >> >> > Hadoop 2
>>> >>> >> >> >> > >> > >> release
>>> >>> >> >> >> > >> > >> >>> >>>> cycle)
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>>> >>> >> >> >> > >> > >> can run
>>> >>> >> >> >> > >> > >> >>> >>>> and
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>>> >>> blockers
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>>> >>> >> >> >> > feel
>>> >>> >> >> >> > >> > >> >>> comfortable
>>> >>> >> >> >> > >> > >> >>> >>>> with
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>>> >>> switch
>>> >>> >> to
>>> >>> >> >> >> beta.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>>> >>> >> >> >> > >> > >> get
>>> >>> >> >> >> > >> > >> >>> >>>> some
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>> >>> >> moniker
>>> >>> >> >> >> > >> > and
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>>> >>> >> >> >> > >> > >> >>> making
>>> >>> >> >> >> > >> > >> >>> >>>> it
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> will
>>> >>> >> become
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>> >>> >> >> >> > >> > >> >>> "current/stable"
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.1-alpha (3
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>>> >>> >> 3.5.3-beta
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.5-beta
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> alpha/beta
>>> >>> but
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>>> >>> >> >> >> > >> > >> "stable" vs
>>> >>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> still
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>>> >>> >> >> >> > >> > >> shake
>>> >>> >> >> >> > >> > >> >>> >>>> out)
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> releases
>>> >>> for
>>> >>> >> >> >> > production
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>>> >>> >> >> >> > >> > >> >>> stable,
>>> >>> >> >> >> > >> > >> >>> >>>>>>> etc...
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently,
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> are any
>>> >>> of
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>>> >>> >> >> >> > >> > >> that
>>> >>> >> >> >> > >> > >> >>> >>>> should
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> a
>>> >>> couple
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>>> >>> >> >> >> > >> > >> find
>>> >>> >> >> >> > >> > >> >>> >>>> this a
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ball
>>> >>> >> rolling to
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>>> >>> >> >> >> > an
>>> >>> >> >> >> > >> RC.
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >>>>>>
>>> >>> >> >> >> > >> > >> >>> >
>>> >>> >> >> >> > >> > >> >>>
>>> >>> >> >> >> > >> > >>
>>> >>> >> >> >> > >>
>>> >>> >> >> >> > >>
>>> >>> >> >> >> >
>>> >>> >> >> >>
>>> >>> >> >>
>>> >>> >>
>>> >>>
>>
>>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Actually if servers 1 and 3 are talking and 3 is elected and not 1, it
means that 3 also saw the reconfig. So it should also complete it when it
reboots. To debug this I suggest to print out the last seen config in the
beginning of leader.lead().

Is it possible that writing the .next file to disk fails ?

Alternatively we could just remove this part of the test (attached patch) -
the test's goal is to check that the leader times out when it looses a
quorum of the new config, and the part of the test that fails now is not
needed to check that. There are other tests in ReconfigRecoveryTest that
are supposed to check recovery.



On Tue, Jul 22, 2014 at 1:07 PM, Alexander Shraer <sh...@gmail.com> wrote:

> yep, I think what happens is that server 3 is becoming leader and not
> server 1, so its not completing the reconfig. Let me think about how to
> solve this...
>
>
> On Tue, Jul 22, 2014 at 12:21 PM, Patrick Hunt <ph...@apache.org> wrote:
>
>> Also if you want to submit a patch that provides more insight (logs)
>> for that operation/test lmk and I'll be happy to review/commit it.
>> Should help with debugging the issue and debugging in the field.
>>
>> Thanks!
>>
>> Patrick
>>
>> On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
>> > Here's the logs (attached) for the test that failed. Nothing stuck out
>> > at me - anything ring a bell?
>> >
>> > Patrick
>> >
>> > On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> >> Unfortunately doesn't look like we have enough logging going on there.
>> >> For example would be nice to know what's the committed config and last
>> seen
>> >> config
>> >> of the leader when it comes up (leader.lead()). and what configuration
>> is
>> >> sent in the NEWLEADER message
>> >> sent out in LeaderHandler:
>> >>
>> >>                 QuorumPacket newLeaderQP = new
>> >> QuorumPacket(Leader.NEWLEADER,
>> >>                         newLeaderZxid,
>> >> leader.self.getLastSeenQuorumVerifier()
>> >>                                 .toString().getBytes(), null);
>> >>
>> >>
>> >> I didn't know about the option to have a separate administrative
>> interface,
>> >> and just followed the flow of other commands... I agree that it would
>> be
>> >> cleaner.
>> >>
>> >>
>> >>
>> >>
>> >> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >>
>> >>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <shralex@gmail.com
>> >
>> >>> wrote:
>> >>> > Hmm. It doesn't really make sense to me - the reconfig should be
>> >>> completed
>> >>> > before
>> >>> > the servers come up and process new ops. We submitted the reconfig
>> to
>> >>> > server 1, it timed out
>> >>> > on new quorum, but when 1 becomes leader again after 2 restarts 1
>> should
>> >>> > complete the reconfig.
>> >>> > is 1 becoming leader after 2 restarts ?
>> >>> >
>> >>>
>> >>> What should I look for in the logs? Any specific log messages that
>> >>> would help debug?
>> >>>
>> >>> > About admin controls - reconfig/getConfig are open to everyone,
>> unless
>> >>> you
>> >>> > set permissions on the configuration znode being written during
>> reconfig.
>> >>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>> >>> >
>> >>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>> >>> > request.authInfo);
>> >>> >
>> >>>
>> >>> So I can turn off all access then? (read and write). Should we ship
>> >>> that as the default? We should add that to the docs.
>> >>>
>> >>> In the past we've always tried to hide this type of information from
>> >>> clients (e.g. we don't expose the zk server address to the client for
>> >>> a session). This seems like a very big departure. Why didn't we move
>> >>> it to a separate, administrative, interface?
>> >>>
>> >>> Patrick
>> >>>
>> >>> >
>> >>> >
>> >>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com>
>> wrote:
>> >>> >
>> >>> >> Looks like 3 hasn't been removed (unfortunately the assertion
>> doesn't
>> >>> >> include any msg detail, but that's the way it looks to me like the
>> >>> >> test is setup):
>> >>> >>
>> >>> >>         if (leavingServers != null) {
>> >>> >>             for (String leaving : leavingServers)
>> >>> >>
>> >>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>> >>> >>         }
>> >>> >>
>> >>> >> which is called from:
>> >>> >>
>> >>> >>         qu.restart(2);
>> >>> >>         // Now that 2 is back up, they'll complete the reconfig
>> >>> removing 3
>> >>> >> and
>> >>> >>         // can process other ops.
>> >>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>> >>> >>
>> >>> >> It seems like the problem is that testServerHasConfig is not
>> waiting
>> >>> >> for the configuration to be updated? In this case 2 was just
>> restarted
>> >>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>> >>> >> which might be why you aren't seeing the issue? hence the
>> flakeyness)
>> >>> >>
>> >>> >> Patrick
>> >>> >>
>> >>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <
>> shralex@gmail.com>
>> >>> >> wrote:
>> >>> >> > Hi Patrick, I'm not sure why you're seeing this - it consistently
>> >>> passes
>> >>> >> on
>> >>> >> > my machine. In case you'd like to take a look, the test has tons
>> of
>> >>> >> > comments explaining the scenario. Let me know how I can help.
>> >>> >> >
>> >>> >> >
>> >>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>> >>> wrote:
>> >>> >> >
>> >>> >> >> Hi Alex, I've also seen the test
>> "testLeaderTimesoutOnNewQuorum" fail
>> >>> >> >> multiple times (not every time, but ~50%, so flakey) in the
>> last few
>> >>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>> >>> >> >> jenkins, I haven't see any other failures than this during the
>> past
>> >>> >> >> few days).
>> >>> >> >>
>> >>> >> >> junit.framework.AssertionFailedError
>> >>> >> >> at
>> >>> >> >>
>> >>> >>
>> >>>
>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>> >>> >> >> at
>> >>> >> >>
>> >>> >>
>> >>>
>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>> >>> >> >> at
>> >>> >> >>
>> >>> >>
>> >>>
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>> >>> >> >>
>> >>> >> >> Patrick
>> >>> >> >>
>> >>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <
>> shralex@gmail.com
>> >>> >
>> >>> >> >> wrote:
>> >>> >> >> > Hi Rakesh,
>> >>> >> >> >
>> >>> >> >> > Thanks for looking at this. In general even if we find the bug
>> >>> since
>> >>> >> we
>> >>> >> >> > should test it before committing a fix, it seems better to
>> remove
>> >>> the
>> >>> >> >> test
>> >>> >> >> > for now and debug this on a build machine. I'm trying to get
>> >>> access to
>> >>> >> >> it.
>> >>> >> >> >
>> >>> >> >> > Looking at this log:
>> >>> >> >> >
>> >>> >> >>
>> >>> >>
>> >>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>> >>> >> >> >
>> >>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>> >>> version
>> >>> >> >> 200000000
>> >>> >> >> > is already being sent around as committed!
>> >>> >> >> >
>> >>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>> >>> >> >> >
>> >>> >>
>> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>> >>> ]
>> >>> >> >> > - 2 Received version: 200000000 my version: 0
>> >>> >> >> >
>> >>> >> >> >
>> >>> >> >> > and also in leader election messages.
>> >>> >> >> >
>> >>> >> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
>> >>> >> whereas we
>> >>> >> >> > explicitly started it with 100000000.
>> >>> >> >> > Then it makes sense that the new config can't be committed
>> since
>> >>> its
>> >>> >> >> > version is not high enough...
>> >>> >> >> >
>> >>> >> >> > I wonder if its possible that not all servers from the
>> previous
>> >>> test
>> >>> >> are
>> >>> >> >> > dead and they are interfering...
>> >>> >> >> >
>> >>> >> >> >
>> >>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <rakeshr@huawei.com
>> >
>> >>> wrote:
>> >>> >> >> >
>> >>> >> >> >> Hi Alex,
>> >>> >> >> >>
>> >>> >> >> >> Yeah it is consistently passing in my machine also.
>> >>> >> >> >>
>> >>> >> >> >>
>> >>> >> >> >> I have quickly gone through the
>> >>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>> >>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>> >>> version)
>> >>> >> >> has
>> >>> >> >> >> not taken and still leader election is seeing 100000000
>> (n.config
>> >>> >> >> version).
>> >>> >> >> >> Unfortunately I didn't find the reason for not considering
>> the
>> >>> >> updated
>> >>> >> >> >> config version.
>> >>> >> >> >>
>> >>> >> >> >>
>> >>> >> >> >> Reference:
>> >>> >> >> >>
>> >>> >> >>
>> >>> >>
>> >>>
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>> >>> >> >> >>
>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922]
>> -
>> >>> >> >> >> Notification time out: 51200
>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>> >>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922]
>> -
>> >>> >> >> >> Notification time out: 51200
>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >>
>> >>> >> >> >>
>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
>> Notification:
>> >>> 2
>> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005
>> (n.zxid), 0x1
>> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> >>> LOOKING
>> >>> >> (my
>> >>> >> >> >> state)100000000 (n.config version)
>> >>> >> >> >>
>> >>> >> >> >>
>> >>> >> >> >> -Rakesh
>> >>> >> >> >>
>> >>> >> >> >> -----Original Message-----
>> >>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>> >>> >> >> >> Sent: 22 July 2014 11:57
>> >>> >> >> >> To: dev@zookeeper.apache.org
>> >>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >>> >> >> >>
>> >>> >> >> >> I tried to look into it, but the test consistently passes
>> locally
>> >>> on
>> >>> >> two
>> >>> >> >> >> machines.
>> >>> >> >> >> I don't currently have access to the build machine, but I
>> can try
>> >>> to
>> >>> >> ask
>> >>> >> >> >> for access.
>> >>> >> >> >> Unless anyone has a better suggestion, we could remove the
>> failing
>> >>> >> test
>> >>> >> >> in
>> >>> >> >> >> the meanwhile and open a JIRA to add it back...
>> >>> >> >> >>
>> >>> >> >> >>
>> >>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <
>> phunt@apache.org>
>> >>> >> >> wrote:
>> >>> >> >> >>
>> >>> >> >> >> > I'm seeing alot of test failures in
>> >>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone
>> take a
>> >>> >> look?
>> >>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>> >>> >> >> >> >
>> >>> >> >> >> >
>> >>> >> >> >> >
>> >>> >> >>
>> >>>
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> >>> >> >> >> >
>> >>> >>
>> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> >>> >> >> >> > tabpanel#comment-14069024
>> >>> >> >> >> >
>> >>> >> >> >> > Patrick
>> >>> >> >> >> >
>> >>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> >>> >> >> >> > <ra...@gmail.com> wrote:
>> >>> >> >> >> > > lgtm +1
>> >>> >> >> >> > >
>> >>> >> >> >> > >
>> >>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> >>> >> >> >> > > <fp...@yahoo.com.invalid>
>> >>> >> >> >> > wrote:
>> >>> >> >> >> > >
>> >>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>> >>> release, I
>> >>> >> >> >> > >> +think
>> >>> >> >> >> > 72
>> >>> >> >> >> > >> biz hours is enough for the vote.
>> >>> >> >> >> > >>
>> >>> >> >> >> > >> -Flavio
>> >>> >> >> >> > >>
>> >>> >> >> >> > >> > -----Original Message-----
>> >>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> >>> >> >> >> > >> > Sent: 21 July 2014 18:55
>> >>> >> >> >> > >> > To: DevZooKeeper
>> >>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >>> >> >> >> > >> >
>> >>> >> >> >> > >> > I fixed a number of issues. I also started a few
>> threads
>> >>> with
>> >>> >> >> >> > >> > builds@
>> >>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and
>> I
>> >>> worked
>> >>> >> >> >> > through a
>> >>> >> >> >> > >> > number of findbugs issues, it's not closed yet but
>> it's
>> >>> pretty
>> >>> >> >> >> close.
>> >>> >> >> >> > >> >
>> >>> >> >> >> > >> > I don't see why we can't create an RC and start
>> voting this
>> >>> >> week
>> >>> >> >> >> > though.
>> >>> >> >> >> > >> > Anyone disagree?
>> >>> >> >> >> > >> >
>> >>> >> >> >> > >> > How long should we let the vote run, the std 72 biz
>> hours
>> >>> or
>> >>> >> >> >> > >> > should we
>> >>> >> >> >> > >> plan
>> >>> >> >> >> > >> > for more to allow folks more time to test?
>> >>> >> >> >> > >> >
>> >>> >> >> >> > >> > Patrick
>> >>> >> >> >> > >> >
>> >>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez
>> Segalés
>> >>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>> >>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <
>> phunt@apache.org>
>> >>> >> wrote:
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK
>> >>> jobs -
>> >>> >> I'm
>> >>> >> >> >> > trying
>> >>> >> >> >> > >> > >> to fix some of the jobs that were broken during the
>> >>> recent
>> >>> >> >> >> > >> > >> host upgrade.
>> >>> >> >> >> > >> > >>
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > > How are things looking? Is it likely that we can
>> have a
>> >>> >> 3.5.0
>> >>> >> >> >> > >> > > alpha release week or are we still blocked on
>> Jenkins?
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > > -rgs
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >
>> >>> >> >> >> > >> > >> Patrick
>> >>> >> >> >> > >> > >>
>> >>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> >>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>> >>> >> >> >> > >> > >> wrote:
>> >>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>> >>> >> >> >> > >> > >> >
>> >>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander
>> Shraer
>> >>> >> >> >> > >> > >> > <sh...@gmail.com>
>> >>> >> >> >> > >> > >> wrote:
>> >>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille
>> gave a
>> >>> +1
>> >>> >> and
>> >>> >> >> >> > >> > >> >> all
>> >>> >> >> >> > >> > >> subsequent
>> >>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>> >>> >> >> >> > >> > >> >>
>> >>> >> >> >> > >> > >> >>
>> >>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> >>> >> >> >> > >> > >> >> <phunt@apache.org
>> >>> >> >> >> > >
>> >>> >> >> >> > >> > wrote:
>> >>> >> >> >> > >> > >> >>
>> >>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all
>> failing due
>> >>> to,
>> >>> >> >> >> > >> > >> >>> for
>> >>> >> >> >> > e.g.
>> >>> >> >> >> > >> > >> >>> findbugs issues. At the very least our
>> build/test/ci
>> >>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok
>> (the
>> >>> recent
>> >>> >> >> >> > >> > >> >>> startServer fix
>> >>> >> >> >> > and
>> >>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go
>> a
>> >>> long
>> >>> >> way
>> >>> >> >> >> > >> > >> >>> on
>> >>> >> >> >> > that
>> >>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should
>> be
>> >>> >> cleaned
>> >>> >> >> >> > >> > >> >>> up before we cut a release. I started a
>> separate
>> >>> >> thread to
>> >>> >> >> >> > >> > >> >>> discuss
>> >>> >> >> >> > >> the
>> >>> >> >> >> > >> > findbugs issue.
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul
>> on
>> >>> 1919?
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >> >>> Patrick
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio
>> Junqueira
>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> >>> >> >> >> > >> > >> >>> > My take:
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker
>> and
>> >>> it
>> >>> >> can
>> >>> >> >> >> > >> > >> >>> > go
>> >>> >> >> >> > in.
>> >>> >> >> >> > >> > >> >>> > See
>> >>> >> >> >> > >> > >> the
>> >>> >> >> >> > >> > >> >>> jira for comments.
>> >>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first
>> >>> alpha.
>> >>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha
>> depending on
>> >>> >> >> >> > >> > >> >>> > ZK-1919
>> >>> >> >> >> > and
>> >>> >> >> >> > >> > >> ZK-1910,
>> >>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>> > If you agree with this, then we should be
>> able to
>> >>> >> cut a
>> >>> >> >> >> > >> > >> >>> > candidate by
>> >>> >> >> >> > >> > >> the
>> >>> >> >> >> > >> > >> >>> end of this week.
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>> > -Flavio
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> >>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>> >>> >> >> >> > >> wrote:
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c
>> client
>> >>> >> test
>> >>> >> >> >> > >> > >> >>> >> log output
>> >>> >> >> >> > >> > >> here
>> >>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>> >>> >> >> >> > >> > >> >>> >>
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >>
>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> >>> >> >> >> > >> > >> r-
>> >>> >> >> >> > >> > trunk
>> >>> >> >> >> > >> > >> /2372/
>> >>> >> >> >> > >> > >> >>> >>
>> >>> >> >> >> > >> > >> >>> >> Patrick
>> >>> >> >> >> > >> > >> >>> >>
>> >>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick
>> Hunt
>> >>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>> >>> >> >> >> > >> > >> wrote:
>> >>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0
>> (not
>> >>> >> clear
>> >>> >> >> >> > >> > >> >>> >>> to me which
>> >>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is
>> >>> hitting
>> >>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as
>> well.
>> >>> I've
>> >>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>> >>> >> >> >> > >> > >> to
>> >>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So
>> while it
>> >>> >> won't
>> >>> >> >> >> > >> > >> >>> >>> go
>> >>> >> >> >> > to
>> >>> >> >> >> > >> > >> console
>> >>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a
>> failure.
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews
>> for the
>> >>> >> jetty
>> >>> >> >> >> > server
>> >>> >> >> >> > >> > >> support
>> >>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready
>> soon.
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
>> >>> >> reviewing
>> >>> >> >> >> > >> > >> "ZOOKEEPER-1919
>> >>> >> >> >> > >> > >> >>> >>> Update the C implementation of
>> removeWatches to
>> >>> >> have
>> >>> >> >> >> > >> > >> >>> >>> it
>> >>> >> >> >> > >> > match
>> >>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it
>> in
>> >>> 3.5.0.
>> >>> >> >> >> > >> Flavio/Michi?
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the
>> flakey c
>> >>> >> client
>> >>> >> >> >> > >> > >> >>> >>> reconfig
>> >>> >> >> >> > >> > >> test -
>> >>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test
>> infra!
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like
>> we're
>> >>> >> pretty
>> >>> >> >> >> > close.
>> >>> >> >> >> > >> > >> >>> >>> Do
>> >>> >> >> >> > >> > >> folks
>> >>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this
>> >>> point?
>> >>> >> >> >> > >> > >> >>> >>> (with a few
>> >>> >> >> >> > >> > >> pending
>> >>> >> >> >> > >> > >> >>> >>> as above)
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> Patrick
>> >>> >> >> >> > >> > >> >>> >>>
>> >>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl
>> Gutiérrez
>> >>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> >>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio
>> Junqueira"
>> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much,
>> what if
>> >>> we
>> >>> >> >> >> > >> > >> >>> >>>>> release
>> >>> >> >> >> > an
>> >>> >> >> >> > >> > >> >>> >>>>> alpha
>> >>> >> >> >> > >> > >> >>> version
>> >>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one
>> in
>> >>> 2-3
>> >>> >> >> >> > >> > >> >>> >>>> weeks
>> >>> >> >> >> > time?
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>
>> >>> >> >> >> > >> > >> >>> >>>> +1
>> >>> >> >> >> > >> > >> >>> >>>>
>> >>> >> >> >> > >> > >> >>> >>>> -rgs
>> >>> >> >> >> > >> > >> >>> >>>>
>> >>> >> >> >> > >> > >> >>> >>>>> -Flavio
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>> >>> Gutiérrez
>> >>> >> >> >> > Segalés <
>> >>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> >>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>> >>> >> >> >> > >> > wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on
>> 5.1.0
>> >>> >> (from 8
>> >>> >> >> >> > >> > >> >>> >>>>>>> in
>> >>> >> >> >> > the
>> >>> >> >> >> > >> > >> >>> >>>>>>> last
>> >>> >> >> >> > >> > >> >>> check).
>> >>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from
>> Michi, and
>> >>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>> >>> >> >> >> > >> > >> threatening
>> >>> >> >> >> > >> > >> >>> to
>> >>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress
>> in
>> >>> >> general
>> >>> >> >> >> > >> > >> >>> >>>>>>> on
>> >>> >> >> >> > the
>> >>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great
>> to
>> >>> see.
>> >>> >> >> >> > >> > >> >>> >>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might
>> consider -
>> >>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>> >>> >> >> >> > >> > >> jdk6
>> >>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL
>> by
>> >>> Oracle
>> >>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>> >>> >> >> >> > >> > >> some
>> >>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We
>> gotta
>> >>> move
>> >>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>> >>> >> >> >> > >> > >> >>> can't
>> >>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note
>> that we
>> >>> are
>> >>> >> >> >> > currently
>> >>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7
>> and
>> >>> 8.
>> >>> >> >> >> > >> > >> >>> >>>>>>>
>> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >>> >> >> >> > >> > >> >>> >>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>> >>> >> >> >> > >> > >> >>> >>>>
>> >>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone
>> using
>> >>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>> >>> >> >> >> > >> > >> >>> upcoming
>> >>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network
>> usage go
>> >>> >> >> >> wild...).
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>> -rgs
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>> >>> >> >> >> > >> > >> >>> >>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>> >>> >> Junqueira
>> >>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>> >>> already,
>> >>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>> >>> >> >> >> > >> > >> >>> >>>> there. I
>> >>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because
>> LETest
>> >>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>> >>> >> >> >> > >> > >> last QA
>> >>> >> >> >> > >> > >> >>> run
>> >>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't
>> affect
>> >>> >> LETest,
>> >>> >> >> >> > >> > >> >>> >>>>>>> and
>> >>> >> >> >> > in
>> >>> >> >> >> > >> > >> >>> >>>>>>> fact
>> >>> >> >> >> > >> > >> it
>> >>> >> >> >> > >> > >> >>> fails
>> >>> >> >> >> > >> > >> >>> >>>> in
>> >>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test
>> failure
>> >>> >> doesn't
>> >>> >> >> >> > >> > >> >>> >>>>>>> seem
>> >>> >> >> >> > to
>> >>> >> >> >> > >> > >> >>> >>>>>>> be
>> >>> >> >> >> > >> > >> >>> related
>> >>> >> >> >> > >> > >> >>> >>>> to the
>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't
>> say
>> >>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>> >>> >> >> >> > >> > >> about
>> >>> >> >> >> > >> > >> >>> it.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM,
>> Patrick
>> >>> >> Hunt <
>> >>> >> >> >> > >> > >> phunt@apache.org>
>> >>> >> >> >> > >> > >> >>> >>>> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras
>> can
>> >>> get
>> >>> >> >> >> > >> > >> reviewed/finalized
>> >>> >> >> >> > >> > >> >>> before
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>> >>> cutting
>> >>> >> an
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>> >>> >> >> >> > >> > soonish...
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM,
>> Flavio
>> >>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira
>> <fp...@yahoo.com.invalid>
>> >>> >> >> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>> >>> versions.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810)
>> and
>> >>> >> ZK-1863
>> >>> >> >> in.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>> >>> >> >> >> > >> > >> both
>> >>> >> >> >> > >> > >> >>> >>>> patch
>> >>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it
>> is
>> >>> still
>> >>> >> >> >> > >> > >> >>> >>>>>>> open because we
>> >>> >> >> >> > >> > >> >>> need a
>> >>> >> >> >> > >> > >> >>> >>>> 3.4
>> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick
>> Hunt
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> >>> >> >> >> > >> > >> >>> wrote:
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking
>> about it
>> >>> for
>> >>> >> a
>> >>> >> >> >> > while, a
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>> >>> >> >> >> > >> > >> >>> people
>> >>> >> >> >> > >> > >> >>> >>>> have
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>> >>> contacted
>> >>> >> me
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>> >>> >> >> >> > >> > >> that
>> >>> >> >> >> > >> > >> >>> they
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on
>> the
>> >>> >> first
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>> >>> >> >> >> > >> > release.
>> >>> >> >> >> > >> > >> Every
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we
>> wait for
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> >>> >> >> >> > >> > >> never
>> >>> >> >> >> > >> > >> >>> get
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has
>> tons of
>> >>> >> great
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>> >>> >> >> >> > >> > >> >>> lots of
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>> >>> release so
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>> >>> >> >> >> > >> > >> use
>> >>> >> >> >> > >> > >> >>> it,
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty
>> stable
>> >>> except
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>> >>> >> >> >> > >> > >> >>> flakey
>> >>> >> >> >> > >> > >> >>> >>>> test
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio
>> committed
>> >>> >> today to
>> >>> >> >> >> > >> > trunk.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>> >>> >> >> >> > >> > >> that
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the
>> >>> code on
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>> >>> >> >> >> > and
>> >>> >> >> >> > >> > jdk8.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>> >>> should
>> >>> >> plan
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>> >>> >> >> >> > >> > >> releases:
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> >>> >> >> >> > for
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>> >>> >> >> >> > >> > >> time.
>> >>> >> >> >> > >> > >> >>> >>>> What I
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is
>> similar to
>> >>> >> what
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>> >>> >> >> >> > did
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>> >>> >> >> >> > >> > >> 3.4.
>> >>> >> >> >> > >> > >> >>> >>>> (this is
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did
>> during
>> >>> >> their
>> >>> >> >> >> > Hadoop 2
>> >>> >> >> >> > >> > >> release
>> >>> >> >> >> > >> > >> >>> >>>> cycle)
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha
>> releases,
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>> >>> >> >> >> > >> > >> can run
>> >>> >> >> >> > >> > >> >>> >>>> and
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>> >>> blockers
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>> >>> >> >> >> > feel
>> >>> >> >> >> > >> > >> >>> comfortable
>> >>> >> >> >> > >> > >> >>> >>>> with
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>> >>> switch
>> >>> >> to
>> >>> >> >> >> beta.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>> >>> >> >> >> > >> > >> get
>> >>> >> >> >> > >> > >> >>> >>>> some
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the
>> alpha/beta
>> >>> >> moniker
>> >>> >> >> >> > >> > and
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>> >>> >> >> >> > >> > >> >>> making
>> >>> >> >> >> > >> > >> >>> >>>> it
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it
>> will
>> >>> >> become
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>> >>> >> >> >> > >> > >> >>> "current/stable"
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>> 3.5.1-alpha (3
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>> >>> >> 3.5.3-beta
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered
>> alpha/beta
>> >>> but
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>> >>> >> >> >> > >> > >> "stable" vs
>> >>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we
>> still
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>> >>> >> >> >> > >> > >> shake
>> >>> >> >> >> > >> > >> >>> >>>> out)
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4
>> releases
>> >>> for
>> >>> >> >> >> > production
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>> >>> >> >> >> > >> > >> >>> stable,
>> >>> >> >> >> > >> > >> >>> >>>>>>> etc...
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently,
>> are any
>> >>> of
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>> >>> >> >> >> > >> > >> that
>> >>> >> >> >> > >> > >> >>> >>>> should
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a
>> >>> couple
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>> >>> >> >> >> > >> > >> find
>> >>> >> >> >> > >> > >> >>> >>>> this a
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
>> >>> >> rolling to
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>> >>> >> >> >> > an
>> >>> >> >> >> > >> RC.
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >>>>>>
>> >>> >> >> >> > >> > >> >>> >
>> >>> >> >> >> > >> > >> >>>
>> >>> >> >> >> > >> > >>
>> >>> >> >> >> > >>
>> >>> >> >> >> > >>
>> >>> >> >> >> >
>> >>> >> >> >>
>> >>> >> >>
>> >>> >>
>> >>>
>>
>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
yep, I think what happens is that server 3 is becoming leader and not
server 1, so its not completing the reconfig. Let me think about how to
solve this...


On Tue, Jul 22, 2014 at 12:21 PM, Patrick Hunt <ph...@apache.org> wrote:

> Also if you want to submit a patch that provides more insight (logs)
> for that operation/test lmk and I'll be happy to review/commit it.
> Should help with debugging the issue and debugging in the field.
>
> Thanks!
>
> Patrick
>
> On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
> > Here's the logs (attached) for the test that failed. Nothing stuck out
> > at me - anything ring a bell?
> >
> > Patrick
> >
> > On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com>
> wrote:
> >> Unfortunately doesn't look like we have enough logging going on there.
> >> For example would be nice to know what's the committed config and last
> seen
> >> config
> >> of the leader when it comes up (leader.lead()). and what configuration
> is
> >> sent in the NEWLEADER message
> >> sent out in LeaderHandler:
> >>
> >>                 QuorumPacket newLeaderQP = new
> >> QuorumPacket(Leader.NEWLEADER,
> >>                         newLeaderZxid,
> >> leader.self.getLastSeenQuorumVerifier()
> >>                                 .toString().getBytes(), null);
> >>
> >>
> >> I didn't know about the option to have a separate administrative
> interface,
> >> and just followed the flow of other commands... I agree that it would be
> >> cleaner.
> >>
> >>
> >>
> >>
> >> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org>
> wrote:
> >>
> >>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <sh...@gmail.com>
> >>> wrote:
> >>> > Hmm. It doesn't really make sense to me - the reconfig should be
> >>> completed
> >>> > before
> >>> > the servers come up and process new ops. We submitted the reconfig to
> >>> > server 1, it timed out
> >>> > on new quorum, but when 1 becomes leader again after 2 restarts 1
> should
> >>> > complete the reconfig.
> >>> > is 1 becoming leader after 2 restarts ?
> >>> >
> >>>
> >>> What should I look for in the logs? Any specific log messages that
> >>> would help debug?
> >>>
> >>> > About admin controls - reconfig/getConfig are open to everyone,
> unless
> >>> you
> >>> > set permissions on the configuration znode being written during
> reconfig.
> >>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
> >>> >
> >>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
> >>> > request.authInfo);
> >>> >
> >>>
> >>> So I can turn off all access then? (read and write). Should we ship
> >>> that as the default? We should add that to the docs.
> >>>
> >>> In the past we've always tried to hide this type of information from
> >>> clients (e.g. we don't expose the zk server address to the client for
> >>> a session). This seems like a very big departure. Why didn't we move
> >>> it to a separate, administrative, interface?
> >>>
> >>> Patrick
> >>>
> >>> >
> >>> >
> >>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com>
> wrote:
> >>> >
> >>> >> Looks like 3 hasn't been removed (unfortunately the assertion
> doesn't
> >>> >> include any msg detail, but that's the way it looks to me like the
> >>> >> test is setup):
> >>> >>
> >>> >>         if (leavingServers != null) {
> >>> >>             for (String leaving : leavingServers)
> >>> >>
> >>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
> >>> >>         }
> >>> >>
> >>> >> which is called from:
> >>> >>
> >>> >>         qu.restart(2);
> >>> >>         // Now that 2 is back up, they'll complete the reconfig
> >>> removing 3
> >>> >> and
> >>> >>         // can process other ops.
> >>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
> >>> >>
> >>> >> It seems like the problem is that testServerHasConfig is not waiting
> >>> >> for the configuration to be updated? In this case 2 was just
> restarted
> >>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
> >>> >> which might be why you aren't seeing the issue? hence the
> flakeyness)
> >>> >>
> >>> >> Patrick
> >>> >>
> >>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <
> shralex@gmail.com>
> >>> >> wrote:
> >>> >> > Hi Patrick, I'm not sure why you're seeing this - it consistently
> >>> passes
> >>> >> on
> >>> >> > my machine. In case you'd like to take a look, the test has tons
> of
> >>> >> > comments explaining the scenario. Let me know how I can help.
> >>> >> >
> >>> >> >
> >>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
> >>> wrote:
> >>> >> >
> >>> >> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum"
> fail
> >>> >> >> multiple times (not every time, but ~50%, so flakey) in the last
> few
> >>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
> >>> >> >> jenkins, I haven't see any other failures than this during the
> past
> >>> >> >> few days).
> >>> >> >>
> >>> >> >> junit.framework.AssertionFailedError
> >>> >> >> at
> >>> >> >>
> >>> >>
> >>>
> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
> >>> >> >> at
> >>> >> >>
> >>> >>
> >>>
> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
> >>> >> >> at
> >>> >> >>
> >>> >>
> >>>
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
> >>> >> >>
> >>> >> >> Patrick
> >>> >> >>
> >>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <
> shralex@gmail.com
> >>> >
> >>> >> >> wrote:
> >>> >> >> > Hi Rakesh,
> >>> >> >> >
> >>> >> >> > Thanks for looking at this. In general even if we find the bug
> >>> since
> >>> >> we
> >>> >> >> > should test it before committing a fix, it seems better to
> remove
> >>> the
> >>> >> >> test
> >>> >> >> > for now and debug this on a build machine. I'm trying to get
> >>> access to
> >>> >> >> it.
> >>> >> >> >
> >>> >> >> > Looking at this log:
> >>> >> >> >
> >>> >> >>
> >>> >>
> >>>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
> >>> >> >> >
> >>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
> >>> version
> >>> >> >> 200000000
> >>> >> >> > is already being sent around as committed!
> >>> >> >> >
> >>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
> >>> >> >> >
> >>> >>
> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
> >>> ]
> >>> >> >> > - 2 Received version: 200000000 my version: 0
> >>> >> >> >
> >>> >> >> >
> >>> >> >> > and also in leader election messages.
> >>> >> >> >
> >>> >> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
> >>> >> whereas we
> >>> >> >> > explicitly started it with 100000000.
> >>> >> >> > Then it makes sense that the new config can't be committed
> since
> >>> its
> >>> >> >> > version is not high enough...
> >>> >> >> >
> >>> >> >> > I wonder if its possible that not all servers from the previous
> >>> test
> >>> >> are
> >>> >> >> > dead and they are interfering...
> >>> >> >> >
> >>> >> >> >
> >>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com>
> >>> wrote:
> >>> >> >> >
> >>> >> >> >> Hi Alex,
> >>> >> >> >>
> >>> >> >> >> Yeah it is consistently passing in my machine also.
> >>> >> >> >>
> >>> >> >> >>
> >>> >> >> >> I have quickly gone through the
> >>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
> >>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
> >>> version)
> >>> >> >> has
> >>> >> >> >> not taken and still leader election is seeing 100000000
> (n.config
> >>> >> >> version).
> >>> >> >> >> Unfortunately I didn't find the reason for not considering the
> >>> >> updated
> >>> >> >> >> config version.
> >>> >> >> >>
> >>> >> >> >>
> >>> >> >> >> Reference:
> >>> >> >> >>
> >>> >> >>
> >>> >>
> >>>
> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
> >>> >> >> >>
> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922]
> -
> >>> >> >> >> Notification time out: 51200
> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
> >>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922]
> -
> >>> >> >> >> Notification time out: 51200
> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >>
> >>> >> >> >>
> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
> >>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] -
> Notification:
> >>> 2
> >>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid),
> 0x1
> >>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> >>> LOOKING
> >>> >> (my
> >>> >> >> >> state)100000000 (n.config version)
> >>> >> >> >>
> >>> >> >> >>
> >>> >> >> >> -Rakesh
> >>> >> >> >>
> >>> >> >> >> -----Original Message-----
> >>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
> >>> >> >> >> Sent: 22 July 2014 11:57
> >>> >> >> >> To: dev@zookeeper.apache.org
> >>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
> >>> >> >> >>
> >>> >> >> >> I tried to look into it, but the test consistently passes
> locally
> >>> on
> >>> >> two
> >>> >> >> >> machines.
> >>> >> >> >> I don't currently have access to the build machine, but I can
> try
> >>> to
> >>> >> ask
> >>> >> >> >> for access.
> >>> >> >> >> Unless anyone has a better suggestion, we could remove the
> failing
> >>> >> test
> >>> >> >> in
> >>> >> >> >> the meanwhile and open a JIRA to add it back...
> >>> >> >> >>
> >>> >> >> >>
> >>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <
> phunt@apache.org>
> >>> >> >> wrote:
> >>> >> >> >>
> >>> >> >> >> > I'm seeing alot of test failures in
> >>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone
> take a
> >>> >> look?
> >>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >> >> >
> >>> >> >>
> >>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> >>> >> >> >> >
> >>> >>
> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> >>> >> >> >> > tabpanel#comment-14069024
> >>> >> >> >> >
> >>> >> >> >> > Patrick
> >>> >> >> >> >
> >>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> >>> >> >> >> > <ra...@gmail.com> wrote:
> >>> >> >> >> > > lgtm +1
> >>> >> >> >> > >
> >>> >> >> >> > >
> >>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
> >>> >> >> >> > > <fp...@yahoo.com.invalid>
> >>> >> >> >> > wrote:
> >>> >> >> >> > >
> >>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
> >>> release, I
> >>> >> >> >> > >> +think
> >>> >> >> >> > 72
> >>> >> >> >> > >> biz hours is enough for the vote.
> >>> >> >> >> > >>
> >>> >> >> >> > >> -Flavio
> >>> >> >> >> > >>
> >>> >> >> >> > >> > -----Original Message-----
> >>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >>> >> >> >> > >> > Sent: 21 July 2014 18:55
> >>> >> >> >> > >> > To: DevZooKeeper
> >>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >>> >> >> >> > >> >
> >>> >> >> >> > >> > I fixed a number of issues. I also started a few
> threads
> >>> with
> >>> >> >> >> > >> > builds@
> >>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I
> >>> worked
> >>> >> >> >> > through a
> >>> >> >> >> > >> > number of findbugs issues, it's not closed yet but it's
> >>> pretty
> >>> >> >> >> close.
> >>> >> >> >> > >> >
> >>> >> >> >> > >> > I don't see why we can't create an RC and start voting
> this
> >>> >> week
> >>> >> >> >> > though.
> >>> >> >> >> > >> > Anyone disagree?
> >>> >> >> >> > >> >
> >>> >> >> >> > >> > How long should we let the vote run, the std 72 biz
> hours
> >>> or
> >>> >> >> >> > >> > should we
> >>> >> >> >> > >> plan
> >>> >> >> >> > >> > for more to allow folks more time to test?
> >>> >> >> >> > >> >
> >>> >> >> >> > >> > Patrick
> >>> >> >> >> > >> >
> >>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez
> Segalés
> >>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
> >>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <
> phunt@apache.org>
> >>> >> wrote:
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK
> >>> jobs -
> >>> >> I'm
> >>> >> >> >> > trying
> >>> >> >> >> > >> > >> to fix some of the jobs that were broken during the
> >>> recent
> >>> >> >> >> > >> > >> host upgrade.
> >>> >> >> >> > >> > >>
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > > How are things looking? Is it likely that we can
> have a
> >>> >> 3.5.0
> >>> >> >> >> > >> > > alpha release week or are we still blocked on
> Jenkins?
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > > -rgs
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >
> >>> >> >> >> > >> > >> Patrick
> >>> >> >> >> > >> > >>
> >>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
> >>> >> >> >> > >> > >> wrote:
> >>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
> >>> >> >> >> > >> > >> >
> >>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >>> >> >> >> > >> > >> > <sh...@gmail.com>
> >>> >> >> >> > >> > >> wrote:
> >>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille
> gave a
> >>> +1
> >>> >> and
> >>> >> >> >> > >> > >> >> all
> >>> >> >> >> > >> > >> subsequent
> >>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
> >>> >> >> >> > >> > >> >>
> >>> >> >> >> > >> > >> >>
> >>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
> >>> >> >> >> > >> > >> >> <phunt@apache.org
> >>> >> >> >> > >
> >>> >> >> >> > >> > wrote:
> >>> >> >> >> > >> > >> >>
> >>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing
> due
> >>> to,
> >>> >> >> >> > >> > >> >>> for
> >>> >> >> >> > e.g.
> >>> >> >> >> > >> > >> >>> findbugs issues. At the very least our
> build/test/ci
> >>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the
> >>> recent
> >>> >> >> >> > >> > >> >>> startServer fix
> >>> >> >> >> > and
> >>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go a
> >>> long
> >>> >> way
> >>> >> >> >> > >> > >> >>> on
> >>> >> >> >> > that
> >>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should
> be
> >>> >> cleaned
> >>> >> >> >> > >> > >> >>> up before we cut a release. I started a separate
> >>> >> thread to
> >>> >> >> >> > >> > >> >>> discuss
> >>> >> >> >> > >> the
> >>> >> >> >> > >> > findbugs issue.
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on
> >>> 1919?
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >> >>> Patrick
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio
> Junqueira
> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >>> >> >> >> > >> > >> >>> > My take:
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker
> and
> >>> it
> >>> >> can
> >>> >> >> >> > >> > >> >>> > go
> >>> >> >> >> > in.
> >>> >> >> >> > >> > >> >>> > See
> >>> >> >> >> > >> > >> the
> >>> >> >> >> > >> > >> >>> jira for comments.
> >>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first
> >>> alpha.
> >>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha
> depending on
> >>> >> >> >> > >> > >> >>> > ZK-1919
> >>> >> >> >> > and
> >>> >> >> >> > >> > >> ZK-1910,
> >>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>> > If you agree with this, then we should be
> able to
> >>> >> cut a
> >>> >> >> >> > >> > >> >>> > candidate by
> >>> >> >> >> > >> > >> the
> >>> >> >> >> > >> > >> >>> end of this week.
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>> > -Flavio
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
> >>> >> >> >> > >> > >> >>> > <ph...@apache.org>
> >>> >> >> >> > >> wrote:
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c
> client
> >>> >> test
> >>> >> >> >> > >> > >> >>> >> log output
> >>> >> >> >> > >> > >> here
> >>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
> >>> >> >> >> > >> > >> >>> >>
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >>
> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> >>> >> >> >> > >> > >> r-
> >>> >> >> >> > >> > trunk
> >>> >> >> >> > >> > >> /2372/
> >>> >> >> >> > >> > >> >>> >>
> >>> >> >> >> > >> > >> >>> >> Patrick
> >>> >> >> >> > >> > >> >>> >>
> >>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
> >>> >> >> >> > >> > >> wrote:
> >>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0
> (not
> >>> >> clear
> >>> >> >> >> > >> > >> >>> >>> to me which
> >>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is
> >>> hitting
> >>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as
> well.
> >>> I've
> >>> >> >> >> > >> > >> >>> >>> updated the "archive" list
> >>> >> >> >> > >> > >> to
> >>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So
> while it
> >>> >> won't
> >>> >> >> >> > >> > >> >>> >>> go
> >>> >> >> >> > to
> >>> >> >> >> > >> > >> console
> >>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a
> failure.
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for
> the
> >>> >> jetty
> >>> >> >> >> > server
> >>> >> >> >> > >> > >> support
> >>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
> >>> >> reviewing
> >>> >> >> >> > >> > >> "ZOOKEEPER-1919
> >>> >> >> >> > >> > >> >>> >>> Update the C implementation of
> removeWatches to
> >>> >> have
> >>> >> >> >> > >> > >> >>> >>> it
> >>> >> >> >> > >> > match
> >>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in
> >>> 3.5.0.
> >>> >> >> >> > >> Flavio/Michi?
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the
> flakey c
> >>> >> client
> >>> >> >> >> > >> > >> >>> >>> reconfig
> >>> >> >> >> > >> > >> test -
> >>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test
> infra!
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like
> we're
> >>> >> pretty
> >>> >> >> >> > close.
> >>> >> >> >> > >> > >> >>> >>> Do
> >>> >> >> >> > >> > >> folks
> >>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this
> >>> point?
> >>> >> >> >> > >> > >> >>> >>> (with a few
> >>> >> >> >> > >> > >> pending
> >>> >> >> >> > >> > >> >>> >>> as above)
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> Patrick
> >>> >> >> >> > >> > >> >>> >>>
> >>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl
> Gutiérrez
> >>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> >>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
> >>> >> >> >> > >> > >> >>> >>>> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much,
> what if
> >>> we
> >>> >> >> >> > >> > >> >>> >>>>> release
> >>> >> >> >> > an
> >>> >> >> >> > >> > >> >>> >>>>> alpha
> >>> >> >> >> > >> > >> >>> version
> >>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one
> in
> >>> 2-3
> >>> >> >> >> > >> > >> >>> >>>> weeks
> >>> >> >> >> > time?
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>
> >>> >> >> >> > >> > >> >>> >>>> +1
> >>> >> >> >> > >> > >> >>> >>>>
> >>> >> >> >> > >> > >> >>> >>>> -rgs
> >>> >> >> >> > >> > >> >>> >>>>
> >>> >> >> >> > >> > >> >>> >>>>> -Flavio
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
> >>> Gutiérrez
> >>> >> >> >> > Segalés <
> >>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
> >>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
> >>> >> >> >> > >> > wrote:
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on
> 5.1.0
> >>> >> (from 8
> >>> >> >> >> > >> > >> >>> >>>>>>> in
> >>> >> >> >> > the
> >>> >> >> >> > >> > >> >>> >>>>>>> last
> >>> >> >> >> > >> > >> >>> check).
> >>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi,
> and
> >>> >> >> >> > >> > >> >>> >>>>>>> Camille is
> >>> >> >> >> > >> > >> threatening
> >>> >> >> >> > >> > >> >>> to
> >>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress
> in
> >>> >> general
> >>> >> >> >> > >> > >> >>> >>>>>>> on
> >>> >> >> >> > the
> >>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great
> to
> >>> see.
> >>> >> >> >> > >> > >> >>> >>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might
> consider -
> >>> >> >> >> > >> > >> >>> >>>>>>> should we drop
> >>> >> >> >> > >> > >> jdk6
> >>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by
> >>> Oracle
> >>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
> >>> >> >> >> > >> > >> some
> >>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We
> gotta
> >>> move
> >>> >> >> >> > >> > >> >>> >>>>>>> forward though,
> >>> >> >> >> > >> > >> >>> can't
> >>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that
> we
> >>> are
> >>> >> >> >> > currently
> >>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7
> and
> >>> 8.
> >>> >> >> >> > >> > >> >>> >>>>>>>
> >>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >>> >> >> >> > >> > >> >>> >>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
> >>> >> >> >> > >> > >> >>> >>>>
> >>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone
> using
> >>> >> >> >> > >> > >> >>> >>>>>> Observers with the
> >>> >> >> >> > >> > >> >>> upcoming
> >>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network
> usage go
> >>> >> >> >> wild...).
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>> -rgs
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>> Patrick
> >>> >> >> >> > >> > >> >>> >>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
> >>> >> Junqueira
> >>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
> >>> already,
> >>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
> >>> >> >> >> > >> > >> >>> >>>> there. I
> >>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because
> LETest
> >>> >> >> >> > >> > >> >>> >>>>>>> failed in the
> >>> >> >> >> > >> > >> last QA
> >>> >> >> >> > >> > >> >>> run
> >>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't
> affect
> >>> >> LETest,
> >>> >> >> >> > >> > >> >>> >>>>>>> and
> >>> >> >> >> > in
> >>> >> >> >> > >> > >> >>> >>>>>>> fact
> >>> >> >> >> > >> > >> it
> >>> >> >> >> > >> > >> >>> fails
> >>> >> >> >> > >> > >> >>> >>>> in
> >>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test
> failure
> >>> >> doesn't
> >>> >> >> >> > >> > >> >>> >>>>>>> seem
> >>> >> >> >> > to
> >>> >> >> >> > >> > >> >>> >>>>>>> be
> >>> >> >> >> > >> > >> >>> related
> >>> >> >> >> > >> > >> >>> >>>> to the
> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't
> say
> >>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
> >>> >> >> >> > >> > >> about
> >>> >> >> >> > >> > >> >>> it.
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM,
> Patrick
> >>> >> Hunt <
> >>> >> >> >> > >> > >> phunt@apache.org>
> >>> >> >> >> > >> > >> >>> >>>> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras
> can
> >>> get
> >>> >> >> >> > >> > >> reviewed/finalized
> >>> >> >> >> > >> > >> >>> before
> >>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
> >>> cutting
> >>> >> an
> >>> >> >> >> > >> > >> >>> >>>>>>>>> RC
> >>> >> >> >> > >> > soonish...
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM,
> Flavio
> >>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira
> <fp...@yahoo.com.invalid>
> >>> >> >> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
> >>> versions.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810)
> and
> >>> >> ZK-1863
> >>> >> >> in.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
> >>> >> >> >> > >> > >> both
> >>> >> >> >> > >> > >> >>> >>>> patch
> >>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it
> is
> >>> still
> >>> >> >> >> > >> > >> >>> >>>>>>> open because we
> >>> >> >> >> > >> > >> >>> need a
> >>> >> >> >> > >> > >> >>> >>>> 3.4
> >>> >> >> >> > >> > >> >>> >>>>>>> patch.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick
> Hunt
> >>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >>> >> >> >> > >> > >> >>> wrote:
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about
> it
> >>> for
> >>> >> a
> >>> >> >> >> > while, a
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
> >>> >> >> >> > >> > >> >>> people
> >>> >> >> >> > >> > >> >>> >>>> have
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
> >>> contacted
> >>> >> me
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
> >>> >> >> >> > >> > >> that
> >>> >> >> >> > >> > >> >>> they
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on
> the
> >>> >> first
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
> >>> >> >> >> > >> > release.
> >>> >> >> >> > >> > >> Every
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait
> for
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
> >>> >> >> >> > >> > >> never
> >>> >> >> >> > >> > >> >>> get
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons
> of
> >>> >> great
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
> >>> >> >> >> > >> > >> >>> lots of
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
> >>> release so
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
> >>> >> >> >> > >> > >> use
> >>> >> >> >> > >> > >> >>> it,
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable
> >>> except
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
> >>> >> >> >> > >> > >> >>> flakey
> >>> >> >> >> > >> > >> >>> >>>> test
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio
> committed
> >>> >> today to
> >>> >> >> >> > >> > trunk.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
> >>> >> >> >> > >> > >> that
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the
> >>> code on
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
> >>> >> >> >> > and
> >>> >> >> >> > >> > jdk8.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
> >>> should
> >>> >> plan
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
> >>> >> >> >> > >> > >> releases:
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> >>> >> >> >> > for
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
> >>> >> >> >> > >> > >> time.
> >>> >> >> >> > >> > >> >>> >>>> What I
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is
> similar to
> >>> >> what
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
> >>> >> >> >> > did
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
> >>> >> >> >> > >> > >> 3.4.
> >>> >> >> >> > >> > >> >>> >>>> (this is
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did
> during
> >>> >> their
> >>> >> >> >> > Hadoop 2
> >>> >> >> >> > >> > >> release
> >>> >> >> >> > >> > >> >>> >>>> cycle)
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha
> releases,
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
> >>> >> >> >> > >> > >> can run
> >>> >> >> >> > >> > >> >>> >>>> and
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
> >>> blockers
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
> >>> >> >> >> > feel
> >>> >> >> >> > >> > >> >>> comfortable
> >>> >> >> >> > >> > >> >>> >>>> with
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
> >>> switch
> >>> >> to
> >>> >> >> >> beta.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
> >>> >> >> >> > >> > >> get
> >>> >> >> >> > >> > >> >>> >>>> some
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the
> alpha/beta
> >>> >> moniker
> >>> >> >> >> > >> > and
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
> >>> >> >> >> > >> > >> >>> making
> >>> >> >> >> > >> > >> >>> >>>> it
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it
> will
> >>> >> become
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
> >>> >> >> >> > >> > >> >>> "current/stable"
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
> 3.5.1-alpha (3
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
> >>> >> 3.5.3-beta
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered
> alpha/beta
> >>> but
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
> >>> >> >> >> > >> > >> "stable" vs
> >>> >> >> >> > >> > >> >>> >>>> 3.4.x,
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we
> still
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
> >>> >> >> >> > >> > >> shake
> >>> >> >> >> > >> > >> >>> >>>> out)
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4
> releases
> >>> for
> >>> >> >> >> > production
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
> >>> >> >> >> > >> > >> >>> stable,
> >>> >> >> >> > >> > >> >>> >>>>>>> etc...
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are
> any
> >>> of
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
> >>> >> >> >> > >> > >> that
> >>> >> >> >> > >> > >> >>> >>>> should
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a
> >>> couple
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
> >>> >> >> >> > >> > >> find
> >>> >> >> >> > >> > >> >>> >>>> this a
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
> >>> >> rolling to
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
> >>> >> >> >> > an
> >>> >> >> >> > >> RC.
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
> >>> >> >> >> > >> > >> >>> >>>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >>>>>>
> >>> >> >> >> > >> > >> >>> >
> >>> >> >> >> > >> > >> >>>
> >>> >> >> >> > >> > >>
> >>> >> >> >> > >>
> >>> >> >> >> > >>
> >>> >> >> >> >
> >>> >> >> >>
> >>> >> >>
> >>> >>
> >>>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Also if you want to submit a patch that provides more insight (logs)
for that operation/test lmk and I'll be happy to review/commit it.
Should help with debugging the issue and debugging in the field.

Thanks!

Patrick

On Tue, Jul 22, 2014 at 12:17 PM, Patrick Hunt <ph...@apache.org> wrote:
> Here's the logs (attached) for the test that failed. Nothing stuck out
> at me - anything ring a bell?
>
> Patrick
>
> On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com> wrote:
>> Unfortunately doesn't look like we have enough logging going on there.
>> For example would be nice to know what's the committed config and last seen
>> config
>> of the leader when it comes up (leader.lead()). and what configuration is
>> sent in the NEWLEADER message
>> sent out in LeaderHandler:
>>
>>                 QuorumPacket newLeaderQP = new
>> QuorumPacket(Leader.NEWLEADER,
>>                         newLeaderZxid,
>> leader.self.getLastSeenQuorumVerifier()
>>                                 .toString().getBytes(), null);
>>
>>
>> I didn't know about the option to have a separate administrative interface,
>> and just followed the flow of other commands... I agree that it would be
>> cleaner.
>>
>>
>>
>>
>> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <sh...@gmail.com>
>>> wrote:
>>> > Hmm. It doesn't really make sense to me - the reconfig should be
>>> completed
>>> > before
>>> > the servers come up and process new ops. We submitted the reconfig to
>>> > server 1, it timed out
>>> > on new quorum, but when 1 becomes leader again after 2 restarts 1 should
>>> > complete the reconfig.
>>> > is 1 becoming leader after 2 restarts ?
>>> >
>>>
>>> What should I look for in the logs? Any specific log messages that
>>> would help debug?
>>>
>>> > About admin controls - reconfig/getConfig are open to everyone, unless
>>> you
>>> > set permissions on the configuration znode being written during reconfig.
>>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>>> >
>>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>>> > request.authInfo);
>>> >
>>>
>>> So I can turn off all access then? (read and write). Should we ship
>>> that as the default? We should add that to the docs.
>>>
>>> In the past we've always tried to hide this type of information from
>>> clients (e.g. we don't expose the zk server address to the client for
>>> a session). This seems like a very big departure. Why didn't we move
>>> it to a separate, administrative, interface?
>>>
>>> Patrick
>>>
>>> >
>>> >
>>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com> wrote:
>>> >
>>> >> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
>>> >> include any msg detail, but that's the way it looks to me like the
>>> >> test is setup):
>>> >>
>>> >>         if (leavingServers != null) {
>>> >>             for (String leaving : leavingServers)
>>> >>
>>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>>> >>         }
>>> >>
>>> >> which is called from:
>>> >>
>>> >>         qu.restart(2);
>>> >>         // Now that 2 is back up, they'll complete the reconfig
>>> removing 3
>>> >> and
>>> >>         // can process other ops.
>>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>>> >>
>>> >> It seems like the problem is that testServerHasConfig is not waiting
>>> >> for the configuration to be updated? In this case 2 was just restarted
>>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>>> >> which might be why you aren't seeing the issue? hence the flakeyness)
>>> >>
>>> >> Patrick
>>> >>
>>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com>
>>> >> wrote:
>>> >> > Hi Patrick, I'm not sure why you're seeing this - it consistently
>>> passes
>>> >> on
>>> >> > my machine. In case you'd like to take a look, the test has tons of
>>> >> > comments explaining the scenario. Let me know how I can help.
>>> >> >
>>> >> >
>>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>>> wrote:
>>> >> >
>>> >> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>>> >> >> multiple times (not every time, but ~50%, so flakey) in the last few
>>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>>> >> >> jenkins, I haven't see any other failures than this during the past
>>> >> >> few days).
>>> >> >>
>>> >> >> junit.framework.AssertionFailedError
>>> >> >> at
>>> >> >>
>>> >>
>>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>>> >> >> at
>>> >> >>
>>> >>
>>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>>> >> >> at
>>> >> >>
>>> >>
>>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>> >> >>
>>> >> >> Patrick
>>> >> >>
>>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <shralex@gmail.com
>>> >
>>> >> >> wrote:
>>> >> >> > Hi Rakesh,
>>> >> >> >
>>> >> >> > Thanks for looking at this. In general even if we find the bug
>>> since
>>> >> we
>>> >> >> > should test it before committing a fix, it seems better to remove
>>> the
>>> >> >> test
>>> >> >> > for now and debug this on a build machine. I'm trying to get
>>> access to
>>> >> >> it.
>>> >> >> >
>>> >> >> > Looking at this log:
>>> >> >> >
>>> >> >>
>>> >>
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>>> >> >> >
>>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>>> version
>>> >> >> 200000000
>>> >> >> > is already being sent around as committed!
>>> >> >> >
>>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>>> >> >> >
>>> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>>> ]
>>> >> >> > - 2 Received version: 200000000 my version: 0
>>> >> >> >
>>> >> >> >
>>> >> >> > and also in leader election messages.
>>> >> >> >
>>> >> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
>>> >> whereas we
>>> >> >> > explicitly started it with 100000000.
>>> >> >> > Then it makes sense that the new config can't be committed since
>>> its
>>> >> >> > version is not high enough...
>>> >> >> >
>>> >> >> > I wonder if its possible that not all servers from the previous
>>> test
>>> >> are
>>> >> >> > dead and they are interfering...
>>> >> >> >
>>> >> >> >
>>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com>
>>> wrote:
>>> >> >> >
>>> >> >> >> Hi Alex,
>>> >> >> >>
>>> >> >> >> Yeah it is consistently passing in my machine also.
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> I have quickly gone through the
>>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>>> version)
>>> >> >> has
>>> >> >> >> not taken and still leader election is seeing 100000000 (n.config
>>> >> >> version).
>>> >> >> >> Unfortunately I didn't find the reason for not considering the
>>> >> updated
>>> >> >> >> config version.
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> Reference:
>>> >> >> >>
>>> >> >>
>>> >>
>>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>> >> >> >>
>>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>>> >> >> >> Notification time out: 51200
>>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>>> >> >> >> Notification time out: 51200
>>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
>>> 2
>>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>>> LOOKING
>>> >> (my
>>> >> >> >> state)100000000 (n.config version)
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> -Rakesh
>>> >> >> >>
>>> >> >> >> -----Original Message-----
>>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>>> >> >> >> Sent: 22 July 2014 11:57
>>> >> >> >> To: dev@zookeeper.apache.org
>>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >> >> >>
>>> >> >> >> I tried to look into it, but the test consistently passes locally
>>> on
>>> >> two
>>> >> >> >> machines.
>>> >> >> >> I don't currently have access to the build machine, but I can try
>>> to
>>> >> ask
>>> >> >> >> for access.
>>> >> >> >> Unless anyone has a better suggestion, we could remove the failing
>>> >> test
>>> >> >> in
>>> >> >> >> the meanwhile and open a JIRA to add it back...
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>>> >> >> wrote:
>>> >> >> >>
>>> >> >> >> > I'm seeing alot of test failures in
>>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone take a
>>> >> look?
>>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>>> >> >> >> >
>>> >> >> >> >
>>> >> >> >> >
>>> >> >>
>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>>> >> >> >> >
>>> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>>> >> >> >> > tabpanel#comment-14069024
>>> >> >> >> >
>>> >> >> >> > Patrick
>>> >> >> >> >
>>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>>> >> >> >> > <ra...@gmail.com> wrote:
>>> >> >> >> > > lgtm +1
>>> >> >> >> > >
>>> >> >> >> > >
>>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>>> >> >> >> > > <fp...@yahoo.com.invalid>
>>> >> >> >> > wrote:
>>> >> >> >> > >
>>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>>> release, I
>>> >> >> >> > >> +think
>>> >> >> >> > 72
>>> >> >> >> > >> biz hours is enough for the vote.
>>> >> >> >> > >>
>>> >> >> >> > >> -Flavio
>>> >> >> >> > >>
>>> >> >> >> > >> > -----Original Message-----
>>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>>> >> >> >> > >> > Sent: 21 July 2014 18:55
>>> >> >> >> > >> > To: DevZooKeeper
>>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >> >> >> > >> >
>>> >> >> >> > >> > I fixed a number of issues. I also started a few threads
>>> with
>>> >> >> >> > >> > builds@
>>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I
>>> worked
>>> >> >> >> > through a
>>> >> >> >> > >> > number of findbugs issues, it's not closed yet but it's
>>> pretty
>>> >> >> >> close.
>>> >> >> >> > >> >
>>> >> >> >> > >> > I don't see why we can't create an RC and start voting this
>>> >> week
>>> >> >> >> > though.
>>> >> >> >> > >> > Anyone disagree?
>>> >> >> >> > >> >
>>> >> >> >> > >> > How long should we let the vote run, the std 72 biz hours
>>> or
>>> >> >> >> > >> > should we
>>> >> >> >> > >> plan
>>> >> >> >> > >> > for more to allow folks more time to test?
>>> >> >> >> > >> >
>>> >> >> >> > >> > Patrick
>>> >> >> >> > >> >
>>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org>
>>> >> wrote:
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK
>>> jobs -
>>> >> I'm
>>> >> >> >> > trying
>>> >> >> >> > >> > >> to fix some of the jobs that were broken during the
>>> recent
>>> >> >> >> > >> > >> host upgrade.
>>> >> >> >> > >> > >>
>>> >> >> >> > >> > >
>>> >> >> >> > >> > > How are things looking? Is it likely that we can have a
>>> >> 3.5.0
>>> >> >> >> > >> > > alpha release week or are we still blocked on Jenkins?
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > > -rgs
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >
>>> >> >> >> > >> > >> Patrick
>>> >> >> >> > >> > >>
>>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>>> >> >> >> > >> > >> wrote:
>>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>>> >> >> >> > >> > >> >
>>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>>> >> >> >> > >> > >> > <sh...@gmail.com>
>>> >> >> >> > >> > >> wrote:
>>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a
>>> +1
>>> >> and
>>> >> >> >> > >> > >> >> all
>>> >> >> >> > >> > >> subsequent
>>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>>> >> >> >> > >> > >> >>
>>> >> >> >> > >> > >> >>
>>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>>> >> >> >> > >> > >> >> <phunt@apache.org
>>> >> >> >> > >
>>> >> >> >> > >> > wrote:
>>> >> >> >> > >> > >> >>
>>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing due
>>> to,
>>> >> >> >> > >> > >> >>> for
>>> >> >> >> > e.g.
>>> >> >> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the
>>> recent
>>> >> >> >> > >> > >> >>> startServer fix
>>> >> >> >> > and
>>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go a
>>> long
>>> >> way
>>> >> >> >> > >> > >> >>> on
>>> >> >> >> > that
>>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should be
>>> >> cleaned
>>> >> >> >> > >> > >> >>> up before we cut a release. I started a separate
>>> >> thread to
>>> >> >> >> > >> > >> >>> discuss
>>> >> >> >> > >> the
>>> >> >> >> > >> > findbugs issue.
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on
>>> 1919?
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >> >>> Patrick
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>>> >> >> >> > >> > >> >>> > My take:
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and
>>> it
>>> >> can
>>> >> >> >> > >> > >> >>> > go
>>> >> >> >> > in.
>>> >> >> >> > >> > >> >>> > See
>>> >> >> >> > >> > >> the
>>> >> >> >> > >> > >> >>> jira for comments.
>>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first
>>> alpha.
>>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>>> >> >> >> > >> > >> >>> > ZK-1919
>>> >> >> >> > and
>>> >> >> >> > >> > >> ZK-1910,
>>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>> > If you agree with this, then we should be able to
>>> >> cut a
>>> >> >> >> > >> > >> >>> > candidate by
>>> >> >> >> > >> > >> the
>>> >> >> >> > >> > >> >>> end of this week.
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>> > -Flavio
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>>> >> >> >> > >> wrote:
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c client
>>> >> test
>>> >> >> >> > >> > >> >>> >> log output
>>> >> >> >> > >> > >> here
>>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>>> >> >> >> > >> > >> >>> >>
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >>
>>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>>> >> >> >> > >> > >> r-
>>> >> >> >> > >> > trunk
>>> >> >> >> > >> > >> /2372/
>>> >> >> >> > >> > >> >>> >>
>>> >> >> >> > >> > >> >>> >> Patrick
>>> >> >> >> > >> > >> >>> >>
>>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>>> >> >> >> > >> > >> wrote:
>>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not
>>> >> clear
>>> >> >> >> > >> > >> >>> >>> to me which
>>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is
>>> hitting
>>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well.
>>> I've
>>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>>> >> >> >> > >> > >> to
>>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it
>>> >> won't
>>> >> >> >> > >> > >> >>> >>> go
>>> >> >> >> > to
>>> >> >> >> > >> > >> console
>>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the
>>> >> jetty
>>> >> >> >> > server
>>> >> >> >> > >> > >> support
>>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
>>> >> reviewing
>>> >> >> >> > >> > >> "ZOOKEEPER-1919
>>> >> >> >> > >> > >> >>> >>> Update the C implementation of removeWatches to
>>> >> have
>>> >> >> >> > >> > >> >>> >>> it
>>> >> >> >> > >> > match
>>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in
>>> 3.5.0.
>>> >> >> >> > >> Flavio/Michi?
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c
>>> >> client
>>> >> >> >> > >> > >> >>> >>> reconfig
>>> >> >> >> > >> > >> test -
>>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like we're
>>> >> pretty
>>> >> >> >> > close.
>>> >> >> >> > >> > >> >>> >>> Do
>>> >> >> >> > >> > >> folks
>>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this
>>> point?
>>> >> >> >> > >> > >> >>> >>> (with a few
>>> >> >> >> > >> > >> pending
>>> >> >> >> > >> > >> >>> >>> as above)
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> Patrick
>>> >> >> >> > >> > >> >>> >>>
>>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>>> >> >> >> > >> > >> >>> >>>> wrote:
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if
>>> we
>>> >> >> >> > >> > >> >>> >>>>> release
>>> >> >> >> > an
>>> >> >> >> > >> > >> >>> >>>>> alpha
>>> >> >> >> > >> > >> >>> version
>>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in
>>> 2-3
>>> >> >> >> > >> > >> >>> >>>> weeks
>>> >> >> >> > time?
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>
>>> >> >> >> > >> > >> >>> >>>> +1
>>> >> >> >> > >> > >> >>> >>>>
>>> >> >> >> > >> > >> >>> >>>> -rgs
>>> >> >> >> > >> > >> >>> >>>>
>>> >> >> >> > >> > >> >>> >>>>> -Flavio
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>>> Gutiérrez
>>> >> >> >> > Segalés <
>>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>>> >> >> >> > >> > wrote:
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0
>>> >> (from 8
>>> >> >> >> > >> > >> >>> >>>>>>> in
>>> >> >> >> > the
>>> >> >> >> > >> > >> >>> >>>>>>> last
>>> >> >> >> > >> > >> >>> check).
>>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>>> >> >> >> > >> > >> threatening
>>> >> >> >> > >> > >> >>> to
>>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in
>>> >> general
>>> >> >> >> > >> > >> >>> >>>>>>> on
>>> >> >> >> > the
>>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to
>>> see.
>>> >> >> >> > >> > >> >>> >>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>>> >> >> >> > >> > >> jdk6
>>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by
>>> Oracle
>>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>>> >> >> >> > >> > >> some
>>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta
>>> move
>>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>>> >> >> >> > >> > >> >>> can't
>>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we
>>> are
>>> >> >> >> > currently
>>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and
>>> 8.
>>> >> >> >> > >> > >> >>> >>>>>>>
>>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>> >> >> >> > >> > >> >>> >>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>>> >> >> >> > >> > >> >>> >>>>
>>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>>> >> >> >> > >> > >> >>> upcoming
>>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>>> >> >> >> wild...).
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>> -rgs
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>>> >> >> >> > >> > >> >>> >>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>>> >> Junqueira
>>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>>> already,
>>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>>> >> >> >> > >> > >> >>> >>>> there. I
>>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>>> >> >> >> > >> > >> last QA
>>> >> >> >> > >> > >> >>> run
>>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect
>>> >> LETest,
>>> >> >> >> > >> > >> >>> >>>>>>> and
>>> >> >> >> > in
>>> >> >> >> > >> > >> >>> >>>>>>> fact
>>> >> >> >> > >> > >> it
>>> >> >> >> > >> > >> >>> fails
>>> >> >> >> > >> > >> >>> >>>> in
>>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure
>>> >> doesn't
>>> >> >> >> > >> > >> >>> >>>>>>> seem
>>> >> >> >> > to
>>> >> >> >> > >> > >> >>> >>>>>>> be
>>> >> >> >> > >> > >> >>> related
>>> >> >> >> > >> > >> >>> >>>> to the
>>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>>> >> >> >> > >> > >> about
>>> >> >> >> > >> > >> >>> it.
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick
>>> >> Hunt <
>>> >> >> >> > >> > >> phunt@apache.org>
>>> >> >> >> > >> > >> >>> >>>> wrote:
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can
>>> get
>>> >> >> >> > >> > >> reviewed/finalized
>>> >> >> >> > >> > >> >>> before
>>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>>> cutting
>>> >> an
>>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>>> >> >> >> > >> > soonish...
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>>> >> >> wrote:
>>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>>> versions.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and
>>> >> ZK-1863
>>> >> >> in.
>>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>>> >> >> >> > >> > >> both
>>> >> >> >> > >> > >> >>> >>>> patch
>>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is
>>> still
>>> >> >> >> > >> > >> >>> >>>>>>> open because we
>>> >> >> >> > >> > >> >>> need a
>>> >> >> >> > >> > >> >>> >>>> 3.4
>>> >> >> >> > >> > >> >>> >>>>>>> patch.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>>> >> >> >> > >> > >> >>> wrote:
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it
>>> for
>>> >> a
>>> >> >> >> > while, a
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>>> >> >> >> > >> > >> >>> people
>>> >> >> >> > >> > >> >>> >>>> have
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>>> contacted
>>> >> me
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>>> >> >> >> > >> > >> that
>>> >> >> >> > >> > >> >>> they
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the
>>> >> first
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>>> >> >> >> > >> > release.
>>> >> >> >> > >> > >> Every
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>>> >> >> >> > >> > >> never
>>> >> >> >> > >> > >> >>> get
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of
>>> >> great
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>>> >> >> >> > >> > >> >>> lots of
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>>> release so
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>>> >> >> >> > >> > >> use
>>> >> >> >> > >> > >> >>> it,
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable
>>> except
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>>> >> >> >> > >> > >> >>> flakey
>>> >> >> >> > >> > >> >>> >>>> test
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed
>>> >> today to
>>> >> >> >> > >> > trunk.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>>> >> >> >> > >> > >> that
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the
>>> code on
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>>> >> >> >> > and
>>> >> >> >> > >> > jdk8.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>>> should
>>> >> plan
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>>> >> >> >> > >> > >> releases:
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>>> >> >> >> > for
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>>> >> >> >> > >> > >> time.
>>> >> >> >> > >> > >> >>> >>>> What I
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to
>>> >> what
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>>> >> >> >> > did
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>>> >> >> >> > >> > >> 3.4.
>>> >> >> >> > >> > >> >>> >>>> (this is
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during
>>> >> their
>>> >> >> >> > Hadoop 2
>>> >> >> >> > >> > >> release
>>> >> >> >> > >> > >> >>> >>>> cycle)
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>>> >> >> >> > >> > >> can run
>>> >> >> >> > >> > >> >>> >>>> and
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>>> blockers
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>>> >> >> >> > feel
>>> >> >> >> > >> > >> >>> comfortable
>>> >> >> >> > >> > >> >>> >>>> with
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>>> switch
>>> >> to
>>> >> >> >> beta.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>>> >> >> >> > >> > >> get
>>> >> >> >> > >> > >> >>> >>>> some
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta
>>> >> moniker
>>> >> >> >> > >> > and
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>>> >> >> >> > >> > >> >>> making
>>> >> >> >> > >> > >> >>> >>>> it
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will
>>> >> become
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>>> >> >> >> > >> > >> >>> "current/stable"
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>>> >> 3.5.3-beta
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta
>>> but
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>>> >> >> >> > >> > >> "stable" vs
>>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>>> >> >> >> > >> > >> shake
>>> >> >> >> > >> > >> >>> >>>> out)
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases
>>> for
>>> >> >> >> > production
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>>> >> >> >> > >> > >> >>> stable,
>>> >> >> >> > >> > >> >>> >>>>>>> etc...
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any
>>> of
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>>> >> >> >> > >> > >> that
>>> >> >> >> > >> > >> >>> >>>> should
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a
>>> couple
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>>> >> >> >> > >> > >> find
>>> >> >> >> > >> > >> >>> >>>> this a
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
>>> >> rolling to
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>>> >> >> >> > an
>>> >> >> >> > >> RC.
>>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>>> >> >> >> > >> > >> >>> >>>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >>>>>>
>>> >> >> >> > >> > >> >>> >
>>> >> >> >> > >> > >> >>>
>>> >> >> >> > >> > >>
>>> >> >> >> > >>
>>> >> >> >> > >>
>>> >> >> >> >
>>> >> >> >>
>>> >> >>
>>> >>
>>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Here's the logs (attached) for the test that failed. Nothing stuck out
at me - anything ring a bell?

Patrick

On Tue, Jul 22, 2014 at 12:10 PM, Alexander Shraer <sh...@gmail.com> wrote:
> Unfortunately doesn't look like we have enough logging going on there.
> For example would be nice to know what's the committed config and last seen
> config
> of the leader when it comes up (leader.lead()). and what configuration is
> sent in the NEWLEADER message
> sent out in LeaderHandler:
>
>                 QuorumPacket newLeaderQP = new
> QuorumPacket(Leader.NEWLEADER,
>                         newLeaderZxid,
> leader.self.getLastSeenQuorumVerifier()
>                                 .toString().getBytes(), null);
>
>
> I didn't know about the option to have a separate administrative interface,
> and just followed the flow of other commands... I agree that it would be
> cleaner.
>
>
>
>
> On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org> wrote:
>
>> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> > Hmm. It doesn't really make sense to me - the reconfig should be
>> completed
>> > before
>> > the servers come up and process new ops. We submitted the reconfig to
>> > server 1, it timed out
>> > on new quorum, but when 1 becomes leader again after 2 restarts 1 should
>> > complete the reconfig.
>> > is 1 becoming leader after 2 restarts ?
>> >
>>
>> What should I look for in the logs? Any specific log messages that
>> would help debug?
>>
>> > About admin controls - reconfig/getConfig are open to everyone, unless
>> you
>> > set permissions on the configuration znode being written during reconfig.
>> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>> >
>> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
>> > request.authInfo);
>> >
>>
>> So I can turn off all access then? (read and write). Should we ship
>> that as the default? We should add that to the docs.
>>
>> In the past we've always tried to hide this type of information from
>> clients (e.g. we don't expose the zk server address to the client for
>> a session). This seems like a very big departure. Why didn't we move
>> it to a separate, administrative, interface?
>>
>> Patrick
>>
>> >
>> >
>> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com> wrote:
>> >
>> >> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
>> >> include any msg detail, but that's the way it looks to me like the
>> >> test is setup):
>> >>
>> >>         if (leavingServers != null) {
>> >>             for (String leaving : leavingServers)
>> >>
>> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>> >>         }
>> >>
>> >> which is called from:
>> >>
>> >>         qu.restart(2);
>> >>         // Now that 2 is back up, they'll complete the reconfig
>> removing 3
>> >> and
>> >>         // can process other ops.
>> >>         testServerHasConfig(zkArr[1], null, leavingServers);
>> >>
>> >> It seems like the problem is that testServerHasConfig is not waiting
>> >> for the configuration to be updated? In this case 2 was just restarted
>> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
>> >> which might be why you aren't seeing the issue? hence the flakeyness)
>> >>
>> >> Patrick
>> >>
>> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com>
>> >> wrote:
>> >> > Hi Patrick, I'm not sure why you're seeing this - it consistently
>> passes
>> >> on
>> >> > my machine. In case you'd like to take a look, the test has tons of
>> >> > comments explaining the scenario. Let me know how I can help.
>> >> >
>> >> >
>> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >> >
>> >> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>> >> >> multiple times (not every time, but ~50%, so flakey) in the last few
>> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>> >> >> jenkins, I haven't see any other failures than this during the past
>> >> >> few days).
>> >> >>
>> >> >> junit.framework.AssertionFailedError
>> >> >> at
>> >> >>
>> >>
>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>> >> >> at
>> >> >>
>> >>
>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>> >> >> at
>> >> >>
>> >>
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>> >> >>
>> >> >> Patrick
>> >> >>
>> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <shralex@gmail.com
>> >
>> >> >> wrote:
>> >> >> > Hi Rakesh,
>> >> >> >
>> >> >> > Thanks for looking at this. In general even if we find the bug
>> since
>> >> we
>> >> >> > should test it before committing a fix, it seems better to remove
>> the
>> >> >> test
>> >> >> > for now and debug this on a build machine. I'm trying to get
>> access to
>> >> >> it.
>> >> >> >
>> >> >> > Looking at this log:
>> >> >> >
>> >> >>
>> >>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>> >> >> >
>> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
>> version
>> >> >> 200000000
>> >> >> > is already being sent around as committed!
>> >> >> >
>> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>> >> >> >
>> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
>> ]
>> >> >> > - 2 Received version: 200000000 my version: 0
>> >> >> >
>> >> >> >
>> >> >> > and also in leader election messages.
>> >> >> >
>> >> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
>> >> whereas we
>> >> >> > explicitly started it with 100000000.
>> >> >> > Then it makes sense that the new config can't be committed since
>> its
>> >> >> > version is not high enough...
>> >> >> >
>> >> >> > I wonder if its possible that not all servers from the previous
>> test
>> >> are
>> >> >> > dead and they are interfering...
>> >> >> >
>> >> >> >
>> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com>
>> wrote:
>> >> >> >
>> >> >> >> Hi Alex,
>> >> >> >>
>> >> >> >> Yeah it is consistently passing in my machine also.
>> >> >> >>
>> >> >> >>
>> >> >> >> I have quickly gone through the
>> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
>> version)
>> >> >> has
>> >> >> >> not taken and still leader election is seeing 100000000 (n.config
>> >> >> version).
>> >> >> >> Unfortunately I didn't find the reason for not considering the
>> >> updated
>> >> >> >> config version.
>> >> >> >>
>> >> >> >>
>> >> >> >> Reference:
>> >> >> >>
>> >> >>
>> >>
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>> >> >> >>
>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>> >> >> >> Notification time out: 51200
>> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>> >> >> >> Notification time out: 51200
>> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >>
>> >> >> >>
>> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
>> 2
>> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
>> LOOKING
>> >> (my
>> >> >> >> state)100000000 (n.config version)
>> >> >> >>
>> >> >> >>
>> >> >> >> -Rakesh
>> >> >> >>
>> >> >> >> -----Original Message-----
>> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>> >> >> >> Sent: 22 July 2014 11:57
>> >> >> >> To: dev@zookeeper.apache.org
>> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> >> >>
>> >> >> >> I tried to look into it, but the test consistently passes locally
>> on
>> >> two
>> >> >> >> machines.
>> >> >> >> I don't currently have access to the build machine, but I can try
>> to
>> >> ask
>> >> >> >> for access.
>> >> >> >> Unless anyone has a better suggestion, we could remove the failing
>> >> test
>> >> >> in
>> >> >> >> the meanwhile and open a JIRA to add it back...
>> >> >> >>
>> >> >> >>
>> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>> >> >> wrote:
>> >> >> >>
>> >> >> >> > I'm seeing alot of test failures in
>> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone take a
>> >> look?
>> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>> >> >> >> >
>> >> >> >> >
>> >> >> >> >
>> >> >>
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> >> >> >> >
>> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> >> >> >> > tabpanel#comment-14069024
>> >> >> >> >
>> >> >> >> > Patrick
>> >> >> >> >
>> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> >> >> >> > <ra...@gmail.com> wrote:
>> >> >> >> > > lgtm +1
>> >> >> >> > >
>> >> >> >> > >
>> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> >> >> >> > > <fp...@yahoo.com.invalid>
>> >> >> >> > wrote:
>> >> >> >> > >
>> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
>> release, I
>> >> >> >> > >> +think
>> >> >> >> > 72
>> >> >> >> > >> biz hours is enough for the vote.
>> >> >> >> > >>
>> >> >> >> > >> -Flavio
>> >> >> >> > >>
>> >> >> >> > >> > -----Original Message-----
>> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> >> >> >> > >> > Sent: 21 July 2014 18:55
>> >> >> >> > >> > To: DevZooKeeper
>> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> >> >> > >> >
>> >> >> >> > >> > I fixed a number of issues. I also started a few threads
>> with
>> >> >> >> > >> > builds@
>> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I
>> worked
>> >> >> >> > through a
>> >> >> >> > >> > number of findbugs issues, it's not closed yet but it's
>> pretty
>> >> >> >> close.
>> >> >> >> > >> >
>> >> >> >> > >> > I don't see why we can't create an RC and start voting this
>> >> week
>> >> >> >> > though.
>> >> >> >> > >> > Anyone disagree?
>> >> >> >> > >> >
>> >> >> >> > >> > How long should we let the vote run, the std 72 biz hours
>> or
>> >> >> >> > >> > should we
>> >> >> >> > >> plan
>> >> >> >> > >> > for more to allow folks more time to test?
>> >> >> >> > >> >
>> >> >> >> > >> > Patrick
>> >> >> >> > >> >
>> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> >> >> >> > >> > <rg...@itevenworks.net> wrote:
>> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org>
>> >> wrote:
>> >> >> >> > >> > >
>> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK
>> jobs -
>> >> I'm
>> >> >> >> > trying
>> >> >> >> > >> > >> to fix some of the jobs that were broken during the
>> recent
>> >> >> >> > >> > >> host upgrade.
>> >> >> >> > >> > >>
>> >> >> >> > >> > >
>> >> >> >> > >> > > How are things looking? Is it likely that we can have a
>> >> 3.5.0
>> >> >> >> > >> > > alpha release week or are we still blocked on Jenkins?
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > > -rgs
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > >
>> >> >> >> > >> > >> Patrick
>> >> >> >> > >> > >>
>> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> >> >> >> > >> > >> <mi...@cs.stanford.edu>
>> >> >> >> > >> > >> wrote:
>> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>> >> >> >> > >> > >> >
>> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> >> >> >> > >> > >> > <sh...@gmail.com>
>> >> >> >> > >> > >> wrote:
>> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a
>> +1
>> >> and
>> >> >> >> > >> > >> >> all
>> >> >> >> > >> > >> subsequent
>> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>> >> >> >> > >> > >> >>
>> >> >> >> > >> > >> >>
>> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> >> >> >> > >> > >> >> <phunt@apache.org
>> >> >> >> > >
>> >> >> >> > >> > wrote:
>> >> >> >> > >> > >> >>
>> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing due
>> to,
>> >> >> >> > >> > >> >>> for
>> >> >> >> > e.g.
>> >> >> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the
>> recent
>> >> >> >> > >> > >> >>> startServer fix
>> >> >> >> > and
>> >> >> >> > >> > >> >>> some other flakeys that have been addressed go a
>> long
>> >> way
>> >> >> >> > >> > >> >>> on
>> >> >> >> > that
>> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should be
>> >> cleaned
>> >> >> >> > >> > >> >>> up before we cut a release. I started a separate
>> >> thread to
>> >> >> >> > >> > >> >>> discuss
>> >> >> >> > >> the
>> >> >> >> > >> > findbugs issue.
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on
>> 1919?
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >> >>> Patrick
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> >> >> >> > >> > >> >>> > My take:
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and
>> it
>> >> can
>> >> >> >> > >> > >> >>> > go
>> >> >> >> > in.
>> >> >> >> > >> > >> >>> > See
>> >> >> >> > >> > >> the
>> >> >> >> > >> > >> >>> jira for comments.
>> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first
>> alpha.
>> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>> >> >> >> > >> > >> >>> > ZK-1919
>> >> >> >> > and
>> >> >> >> > >> > >> ZK-1910,
>> >> >> >> > >> > >> >>> we can leave it for the second alpha.
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>> > If you agree with this, then we should be able to
>> >> cut a
>> >> >> >> > >> > >> >>> > candidate by
>> >> >> >> > >> > >> the
>> >> >> >> > >> > >> >>> end of this week.
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>> > -Flavio
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> >> >> >> > >> > >> >>> > <ph...@apache.org>
>> >> >> >> > >> wrote:
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c client
>> >> test
>> >> >> >> > >> > >> >>> >> log output
>> >> >> >> > >> > >> here
>> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
>> >> >> >> > >> > >> >>> >>
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >>
>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> >> >> >> > >> > >> r-
>> >> >> >> > >> > trunk
>> >> >> >> > >> > >> /2372/
>> >> >> >> > >> > >> >>> >>
>> >> >> >> > >> > >> >>> >> Patrick
>> >> >> >> > >> > >> >>> >>
>> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> >> >> >> > >> > >> >>> >> <ph...@apache.org>
>> >> >> >> > >> > >> wrote:
>> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not
>> >> clear
>> >> >> >> > >> > >> >>> >>> to me which
>> >> >> >> > >> > >> >>> >>> one(s?) is new?)
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is
>> hitting
>> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well.
>> I've
>> >> >> >> > >> > >> >>> >>> updated the "archive" list
>> >> >> >> > >> > >> to
>> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it
>> >> won't
>> >> >> >> > >> > >> >>> >>> go
>> >> >> >> > to
>> >> >> >> > >> > >> console
>> >> >> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the
>> >> jetty
>> >> >> >> > server
>> >> >> >> > >> > >> support
>> >> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
>> >> reviewing
>> >> >> >> > >> > >> "ZOOKEEPER-1919
>> >> >> >> > >> > >> >>> >>> Update the C implementation of removeWatches to
>> >> have
>> >> >> >> > >> > >> >>> >>> it
>> >> >> >> > >> > match
>> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in
>> 3.5.0.
>> >> >> >> > >> Flavio/Michi?
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c
>> >> client
>> >> >> >> > >> > >> >>> >>> reconfig
>> >> >> >> > >> > >> test -
>> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like we're
>> >> pretty
>> >> >> >> > close.
>> >> >> >> > >> > >> >>> >>> Do
>> >> >> >> > >> > >> folks
>> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this
>> point?
>> >> >> >> > >> > >> >>> >>> (with a few
>> >> >> >> > >> > >> pending
>> >> >> >> > >> > >> >>> >>> as above)
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> Patrick
>> >> >> >> > >> > >> >>> >>>
>> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>> >> >> >> > >> > >> >>> >>>> wrote:
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if
>> we
>> >> >> >> > >> > >> >>> >>>>> release
>> >> >> >> > an
>> >> >> >> > >> > >> >>> >>>>> alpha
>> >> >> >> > >> > >> >>> version
>> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in
>> 2-3
>> >> >> >> > >> > >> >>> >>>> weeks
>> >> >> >> > time?
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>
>> >> >> >> > >> > >> >>> >>>> +1
>> >> >> >> > >> > >> >>> >>>>
>> >> >> >> > >> > >> >>> >>>> -rgs
>> >> >> >> > >> > >> >>> >>>>
>> >> >> >> > >> > >> >>> >>>>> -Flavio
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
>> Gutiérrez
>> >> >> >> > Segalés <
>> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>> >> >> >> > >> > wrote:
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0
>> >> (from 8
>> >> >> >> > >> > >> >>> >>>>>>> in
>> >> >> >> > the
>> >> >> >> > >> > >> >>> >>>>>>> last
>> >> >> >> > >> > >> >>> check).
>> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>> >> >> >> > >> > >> >>> >>>>>>> Camille is
>> >> >> >> > >> > >> threatening
>> >> >> >> > >> > >> >>> to
>> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in
>> >> general
>> >> >> >> > >> > >> >>> >>>>>>> on
>> >> >> >> > the
>> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to
>> see.
>> >> >> >> > >> > >> >>> >>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>> >> >> >> > >> > >> >>> >>>>>>> should we drop
>> >> >> >> > >> > >> jdk6
>> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by
>> Oracle
>> >> >> >> > >> > >> >>> >>>>>>> but I suspect
>> >> >> >> > >> > >> some
>> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta
>> move
>> >> >> >> > >> > >> >>> >>>>>>> forward though,
>> >> >> >> > >> > >> >>> can't
>> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we
>> are
>> >> >> >> > currently
>> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and
>> 8.
>> >> >> >> > >> > >> >>> >>>>>>>
>> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >> >> >> > >> > >> >>> >>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>> >> >> >> > >> > >> >>> >>>>
>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>> >> >> >> > >> > >> >>> >>>>>> Observers with the
>> >> >> >> > >> > >> >>> upcoming
>> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>> >> >> >> wild...).
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>> -rgs
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>> Patrick
>> >> >> >> > >> > >> >>> >>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>> >> Junqueira
>> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
>> already,
>> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>> >> >> >> > >> > >> >>> >>>> there. I
>> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>> >> >> >> > >> > >> >>> >>>>>>> failed in the
>> >> >> >> > >> > >> last QA
>> >> >> >> > >> > >> >>> run
>> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect
>> >> LETest,
>> >> >> >> > >> > >> >>> >>>>>>> and
>> >> >> >> > in
>> >> >> >> > >> > >> >>> >>>>>>> fact
>> >> >> >> > >> > >> it
>> >> >> >> > >> > >> >>> fails
>> >> >> >> > >> > >> >>> >>>> in
>> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure
>> >> doesn't
>> >> >> >> > >> > >> >>> >>>>>>> seem
>> >> >> >> > to
>> >> >> >> > >> > >> >>> >>>>>>> be
>> >> >> >> > >> > >> >>> related
>> >> >> >> > >> > >> >>> >>>> to the
>> >> >> >> > >> > >> >>> >>>>>>> patch.
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
>> >> >> >> > >> > >> about
>> >> >> >> > >> > >> >>> it.
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick
>> >> Hunt <
>> >> >> >> > >> > >> phunt@apache.org>
>> >> >> >> > >> > >> >>> >>>> wrote:
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can
>> get
>> >> >> >> > >> > >> reviewed/finalized
>> >> >> >> > >> > >> >>> before
>> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
>> cutting
>> >> an
>> >> >> >> > >> > >> >>> >>>>>>>>> RC
>> >> >> >> > >> > soonish...
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>> >> >> wrote:
>> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
>> versions.
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and
>> >> ZK-1863
>> >> >> in.
>> >> >> >> > >> > >> >>> >>>>>>>>>> They are
>> >> >> >> > >> > >> both
>> >> >> >> > >> > >> >>> >>>> patch
>> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is
>> still
>> >> >> >> > >> > >> >>> >>>>>>> open because we
>> >> >> >> > >> > >> >>> need a
>> >> >> >> > >> > >> >>> >>>> 3.4
>> >> >> >> > >> > >> >>> >>>>>>> patch.
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> >> >> >> > >> > >> >>> wrote:
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it
>> for
>> >> a
>> >> >> >> > while, a
>> >> >> >> > >> > >> >>> >>>>>>>>>>> few
>> >> >> >> > >> > >> >>> people
>> >> >> >> > >> > >> >>> >>>> have
>> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
>> contacted
>> >> me
>> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
>> >> >> >> > >> > >> that
>> >> >> >> > >> > >> >>> they
>> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the
>> >> first
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>> >> >> >> > >> > release.
>> >> >> >> > >> > >> Every
>> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> >> >> >> > >> > >> never
>> >> >> >> > >> > >> >>> get
>> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of
>> >> great
>> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>> >> >> >> > >> > >> >>> lots of
>> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
>> release so
>> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>> >> >> >> > >> > >> use
>> >> >> >> > >> > >> >>> it,
>> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable
>> except
>> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>> >> >> >> > >> > >> >>> flakey
>> >> >> >> > >> > >> >>> >>>> test
>> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed
>> >> today to
>> >> >> >> > >> > trunk.
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
>> >> >> >> > >> > >> that
>> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the
>> code on
>> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>> >> >> >> > and
>> >> >> >> > >> > jdk8.
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
>> should
>> >> plan
>> >> >> >> > >> > >> >>> >>>>>>>>>>> our
>> >> >> >> > >> > >> releases:
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> >> >> >> > for
>> >> >> >> > >> > >> >>> >>>>>>>>>>> some
>> >> >> >> > >> > >> time.
>> >> >> >> > >> > >> >>> >>>> What I
>> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to
>> >> what
>> >> >> >> > >> > >> >>> >>>>>>>>>>> we
>> >> >> >> > did
>> >> >> >> > >> > >> >>> >>>>>>>>>>> for
>> >> >> >> > >> > >> 3.4.
>> >> >> >> > >> > >> >>> >>>> (this is
>> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during
>> >> their
>> >> >> >> > Hadoop 2
>> >> >> >> > >> > >> release
>> >> >> >> > >> > >> >>> >>>> cycle)
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
>> >> >> >> > >> > >> can run
>> >> >> >> > >> > >> >>> >>>> and
>> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
>> blockers
>> >> >> >> > >> > >> >>> >>>>>>>>>>> and
>> >> >> >> > feel
>> >> >> >> > >> > >> >>> comfortable
>> >> >> >> > >> > >> >>> >>>> with
>> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
>> switch
>> >> to
>> >> >> >> beta.
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>> >> >> >> > >> > >> get
>> >> >> >> > >> > >> >>> >>>> some
>> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta
>> >> moniker
>> >> >> >> > >> > and
>> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
>> >> >> >> > >> > >> >>> making
>> >> >> >> > >> > >> >>> >>>> it
>> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will
>> >> become
>> >> >> >> > >> > >> >>> >>>>>>>>>>> the
>> >> >> >> > >> > >> >>> "current/stable"
>> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>> >> 3.5.3-beta
>> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta
>> but
>> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
>> >> >> >> > >> > >> "stable" vs
>> >> >> >> > >> > >> >>> >>>> 3.4.x,
>> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>> >> >> >> > >> > >> shake
>> >> >> >> > >> > >> >>> >>>> out)
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
>> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases
>> for
>> >> >> >> > production
>> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
>> >> >> >> > >> > >> >>> stable,
>> >> >> >> > >> > >> >>> >>>>>>> etc...
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any
>> of
>> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
>> >> >> >> > >> > >> that
>> >> >> >> > >> > >> >>> >>>> should
>> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a
>> couple
>> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>> >> >> >> > >> > >> find
>> >> >> >> > >> > >> >>> >>>> this a
>> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
>> >> rolling to
>> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
>> >> >> >> > an
>> >> >> >> > >> RC.
>> >> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>> >> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >>>>>>
>> >> >> >> > >> > >> >>> >
>> >> >> >> > >> > >> >>>
>> >> >> >> > >> > >>
>> >> >> >> > >>
>> >> >> >> > >>
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Unfortunately doesn't look like we have enough logging going on there.
For example would be nice to know what's the committed config and last seen
config
of the leader when it comes up (leader.lead()). and what configuration is
sent in the NEWLEADER message
sent out in LeaderHandler:

                QuorumPacket newLeaderQP = new
QuorumPacket(Leader.NEWLEADER,
                        newLeaderZxid,
leader.self.getLastSeenQuorumVerifier()
                                .toString().getBytes(), null);


I didn't know about the option to have a separate administrative interface,
and just followed the flow of other commands... I agree that it would be
cleaner.




On Tue, Jul 22, 2014 at 11:36 AM, Patrick Hunt <ph...@apache.org> wrote:

> On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
> > Hmm. It doesn't really make sense to me - the reconfig should be
> completed
> > before
> > the servers come up and process new ops. We submitted the reconfig to
> > server 1, it timed out
> > on new quorum, but when 1 becomes leader again after 2 restarts 1 should
> > complete the reconfig.
> > is 1 becoming leader after 2 restarts ?
> >
>
> What should I look for in the logs? Any specific log messages that
> would help debug?
>
> > About admin controls - reconfig/getConfig are open to everyone, unless
> you
> > set permissions on the configuration znode being written during reconfig.
> >                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
> >
> >                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
> > request.authInfo);
> >
>
> So I can turn off all access then? (read and write). Should we ship
> that as the default? We should add that to the docs.
>
> In the past we've always tried to hide this type of information from
> clients (e.g. we don't expose the zk server address to the client for
> a session). This seems like a very big departure. Why didn't we move
> it to a separate, administrative, interface?
>
> Patrick
>
> >
> >
> > On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com> wrote:
> >
> >> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
> >> include any msg detail, but that's the way it looks to me like the
> >> test is setup):
> >>
> >>         if (leavingServers != null) {
> >>             for (String leaving : leavingServers)
> >>
> >> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
> >>         }
> >>
> >> which is called from:
> >>
> >>         qu.restart(2);
> >>         // Now that 2 is back up, they'll complete the reconfig
> removing 3
> >> and
> >>         // can process other ops.
> >>         testServerHasConfig(zkArr[1], null, leavingServers);
> >>
> >> It seems like the problem is that testServerHasConfig is not waiting
> >> for the configuration to be updated? In this case 2 was just restarted
> >> and 3 hasn't had a chance to be removed? (on a slower machine say,
> >> which might be why you aren't seeing the issue? hence the flakeyness)
> >>
> >> Patrick
> >>
> >> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com>
> >> wrote:
> >> > Hi Patrick, I'm not sure why you're seeing this - it consistently
> passes
> >> on
> >> > my machine. In case you'd like to take a look, the test has tons of
> >> > comments explaining the scenario. Let me know how I can help.
> >> >
> >> >
> >> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org>
> wrote:
> >> >
> >> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
> >> >> multiple times (not every time, but ~50%, so flakey) in the last few
> >> >> days. It's failing both on jdk6 and jdk7. (this is my personal
> >> >> jenkins, I haven't see any other failures than this during the past
> >> >> few days).
> >> >>
> >> >> junit.framework.AssertionFailedError
> >> >> at
> >> >>
> >>
> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
> >> >> at
> >> >>
> >>
> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
> >> >> at
> >> >>
> >>
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
> >> >>
> >> >> Patrick
> >> >>
> >> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <shralex@gmail.com
> >
> >> >> wrote:
> >> >> > Hi Rakesh,
> >> >> >
> >> >> > Thanks for looking at this. In general even if we find the bug
> since
> >> we
> >> >> > should test it before committing a fix, it seems better to remove
> the
> >> >> test
> >> >> > for now and debug this on a build machine. I'm trying to get
> access to
> >> >> it.
> >> >> >
> >> >> > Looking at this log:
> >> >> >
> >> >>
> >>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
> >> >> >
> >> >> > Something weird is going on. Sever 3 hasn't started yet, but
> version
> >> >> 200000000
> >> >> > is already being sent around as committed!
> >> >> >
> >> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
> >> >> >
> >> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293
> ]
> >> >> > - 2 Received version: 200000000 my version: 0
> >> >> >
> >> >> >
> >> >> > and also in leader election messages.
> >> >> >
> >> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
> >> whereas we
> >> >> > explicitly started it with 100000000.
> >> >> > Then it makes sense that the new config can't be committed since
> its
> >> >> > version is not high enough...
> >> >> >
> >> >> > I wonder if its possible that not all servers from the previous
> test
> >> are
> >> >> > dead and they are interfering...
> >> >> >
> >> >> >
> >> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com>
> wrote:
> >> >> >
> >> >> >> Hi Alex,
> >> >> >>
> >> >> >> Yeah it is consistently passing in my machine also.
> >> >> >>
> >> >> >>
> >> >> >> I have quickly gone through the
> >> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
> >> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config
> version)
> >> >> has
> >> >> >> not taken and still leader election is seeing 100000000 (n.config
> >> >> version).
> >> >> >> Unfortunately I didn't find the reason for not considering the
> >> updated
> >> >> >> config version.
> >> >> >>
> >> >> >>
> >> >> >> Reference:
> >> >> >>
> >> >>
> >>
> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
> >> >> >>
> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
> >> >> >> Notification time out: 51200
> >> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
> >> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
> >> >> >> Notification time out: 51200
> >> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >>
> >> >> >>
> >> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
> >> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
> >> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification:
> 2
> >> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch),
> LOOKING
> >> (my
> >> >> >> state)100000000 (n.config version)
> >> >> >>
> >> >> >>
> >> >> >> -Rakesh
> >> >> >>
> >> >> >> -----Original Message-----
> >> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
> >> >> >> Sent: 22 July 2014 11:57
> >> >> >> To: dev@zookeeper.apache.org
> >> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >> >>
> >> >> >> I tried to look into it, but the test consistently passes locally
> on
> >> two
> >> >> >> machines.
> >> >> >> I don't currently have access to the build machine, but I can try
> to
> >> ask
> >> >> >> for access.
> >> >> >> Unless anyone has a better suggestion, we could remove the failing
> >> test
> >> >> in
> >> >> >> the meanwhile and open a JIRA to add it back...
> >> >> >>
> >> >> >>
> >> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
> >> >> wrote:
> >> >> >>
> >> >> >> > I'm seeing alot of test failures in
> >> >> >> > testCurrentObserverIsParticipantInNewConfig could someone take a
> >> look?
> >> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
> >> >> >> >
> >> >> >> >
> >> >> >> >
> >> >>
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> >> >> >> >
> >> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> >> >> >> > tabpanel#comment-14069024
> >> >> >> >
> >> >> >> > Patrick
> >> >> >> >
> >> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> >> >> >> > <ra...@gmail.com> wrote:
> >> >> >> > > lgtm +1
> >> >> >> > >
> >> >> >> > >
> >> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
> >> >> >> > > <fp...@yahoo.com.invalid>
> >> >> >> > wrote:
> >> >> >> > >
> >> >> >> > >> +1 for having an RC this week. Since this is an alpha
> release, I
> >> >> >> > >> +think
> >> >> >> > 72
> >> >> >> > >> biz hours is enough for the vote.
> >> >> >> > >>
> >> >> >> > >> -Flavio
> >> >> >> > >>
> >> >> >> > >> > -----Original Message-----
> >> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >> >> >> > >> > Sent: 21 July 2014 18:55
> >> >> >> > >> > To: DevZooKeeper
> >> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >> >> > >> >
> >> >> >> > >> > I fixed a number of issues. I also started a few threads
> with
> >> >> >> > >> > builds@
> >> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I
> worked
> >> >> >> > through a
> >> >> >> > >> > number of findbugs issues, it's not closed yet but it's
> pretty
> >> >> >> close.
> >> >> >> > >> >
> >> >> >> > >> > I don't see why we can't create an RC and start voting this
> >> week
> >> >> >> > though.
> >> >> >> > >> > Anyone disagree?
> >> >> >> > >> >
> >> >> >> > >> > How long should we let the vote run, the std 72 biz hours
> or
> >> >> >> > >> > should we
> >> >> >> > >> plan
> >> >> >> > >> > for more to allow folks more time to test?
> >> >> >> > >> >
> >> >> >> > >> > Patrick
> >> >> >> > >> >
> >> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> >> >> >> > >> > <rg...@itevenworks.net> wrote:
> >> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org>
> >> wrote:
> >> >> >> > >> > >
> >> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK
> jobs -
> >> I'm
> >> >> >> > trying
> >> >> >> > >> > >> to fix some of the jobs that were broken during the
> recent
> >> >> >> > >> > >> host upgrade.
> >> >> >> > >> > >>
> >> >> >> > >> > >
> >> >> >> > >> > > How are things looking? Is it likely that we can have a
> >> 3.5.0
> >> >> >> > >> > > alpha release week or are we still blocked on Jenkins?
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > > -rgs
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > >
> >> >> >> > >> > >> Patrick
> >> >> >> > >> > >>
> >> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >> >> >> > >> > >> <mi...@cs.stanford.edu>
> >> >> >> > >> > >> wrote:
> >> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
> >> >> >> > >> > >> >
> >> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >> >> >> > >> > >> > <sh...@gmail.com>
> >> >> >> > >> > >> wrote:
> >> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a
> +1
> >> and
> >> >> >> > >> > >> >> all
> >> >> >> > >> > >> subsequent
> >> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
> >> >> >> > >> > >> >>
> >> >> >> > >> > >> >>
> >> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
> >> >> >> > >> > >> >> <phunt@apache.org
> >> >> >> > >
> >> >> >> > >> > wrote:
> >> >> >> > >> > >> >>
> >> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing due
> to,
> >> >> >> > >> > >> >>> for
> >> >> >> > e.g.
> >> >> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
> >> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the
> recent
> >> >> >> > >> > >> >>> startServer fix
> >> >> >> > and
> >> >> >> > >> > >> >>> some other flakeys that have been addressed go a
> long
> >> way
> >> >> >> > >> > >> >>> on
> >> >> >> > that
> >> >> >> > >> > >> >>> issue) but I think the findbugs problem should be
> >> cleaned
> >> >> >> > >> > >> >>> up before we cut a release. I started a separate
> >> thread to
> >> >> >> > >> > >> >>> discuss
> >> >> >> > >> the
> >> >> >> > >> > findbugs issue.
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on
> 1919?
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >> >>> Patrick
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >> >> >> > >> > >> >>> > My take:
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and
> it
> >> can
> >> >> >> > >> > >> >>> > go
> >> >> >> > in.
> >> >> >> > >> > >> >>> > See
> >> >> >> > >> > >> the
> >> >> >> > >> > >> >>> jira for comments.
> >> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first
> alpha.
> >> >> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
> >> >> >> > >> > >> >>> > ZK-1919
> >> >> >> > and
> >> >> >> > >> > >> ZK-1910,
> >> >> >> > >> > >> >>> we can leave it for the second alpha.
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>> > If you agree with this, then we should be able to
> >> cut a
> >> >> >> > >> > >> >>> > candidate by
> >> >> >> > >> > >> the
> >> >> >> > >> > >> >>> end of this week.
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>> > -Flavio
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
> >> >> >> > >> > >> >>> > <ph...@apache.org>
> >> >> >> > >> wrote:
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>> >> Per my previous note you can now see the c client
> >> test
> >> >> >> > >> > >> >>> >> log output
> >> >> >> > >> > >> here
> >> >> >> > >> > >> >>> >> in the "build artifacts" section:
> >> >> >> > >> > >> >>> >>
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >>
> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> >> >> >> > >> > >> r-
> >> >> >> > >> > trunk
> >> >> >> > >> > >> /2372/
> >> >> >> > >> > >> >>> >>
> >> >> >> > >> > >> >>> >> Patrick
> >> >> >> > >> > >> >>> >>
> >> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >> >> >> > >> > >> >>> >> <ph...@apache.org>
> >> >> >> > >> > >> wrote:
> >> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not
> >> clear
> >> >> >> > >> > >> >>> >>> to me which
> >> >> >> > >> > >> >>> >>> one(s?) is new?)
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is
> hitting
> >> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well.
> I've
> >> >> >> > >> > >> >>> >>> updated the "archive" list
> >> >> >> > >> > >> to
> >> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it
> >> won't
> >> >> >> > >> > >> >>> >>> go
> >> >> >> > to
> >> >> >> > >> > >> console
> >> >> >> > >> > >> >>> >>> at least we can debug when there is a failure.
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the
> >> jetty
> >> >> >> > server
> >> >> >> > >> > >> support
> >> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
> >> reviewing
> >> >> >> > >> > >> "ZOOKEEPER-1919
> >> >> >> > >> > >> >>> >>> Update the C implementation of removeWatches to
> >> have
> >> >> >> > >> > >> >>> >>> it
> >> >> >> > >> > match
> >> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in
> 3.5.0.
> >> >> >> > >> Flavio/Michi?
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c
> >> client
> >> >> >> > >> > >> >>> >>> reconfig
> >> >> >> > >> > >> test -
> >> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Based on previous comments it looks like we're
> >> pretty
> >> >> >> > close.
> >> >> >> > >> > >> >>> >>> Do
> >> >> >> > >> > >> folks
> >> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this
> point?
> >> >> >> > >> > >> >>> >>> (with a few
> >> >> >> > >> > >> pending
> >> >> >> > >> > >> >>> >>> as above)
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> Patrick
> >> >> >> > >> > >> >>> >>>
> >> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
> >> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> >> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
> >> >> >> > >> > >> >>> >>>> wrote:
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if
> we
> >> >> >> > >> > >> >>> >>>>> release
> >> >> >> > an
> >> >> >> > >> > >> >>> >>>>> alpha
> >> >> >> > >> > >> >>> version
> >> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in
> 2-3
> >> >> >> > >> > >> >>> >>>> weeks
> >> >> >> > time?
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>
> >> >> >> > >> > >> >>> >>>> +1
> >> >> >> > >> > >> >>> >>>>
> >> >> >> > >> > >> >>> >>>> -rgs
> >> >> >> > >> > >> >>> >>>>
> >> >> >> > >> > >> >>> >>>>> -Flavio
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl
> Gutiérrez
> >> >> >> > Segalés <
> >> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
> >> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
> >> >> >> > >> > wrote:
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0
> >> (from 8
> >> >> >> > >> > >> >>> >>>>>>> in
> >> >> >> > the
> >> >> >> > >> > >> >>> >>>>>>> last
> >> >> >> > >> > >> >>> check).
> >> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
> >> >> >> > >> > >> >>> >>>>>>> Camille is
> >> >> >> > >> > >> threatening
> >> >> >> > >> > >> >>> to
> >> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in
> >> general
> >> >> >> > >> > >> >>> >>>>>>> on
> >> >> >> > the
> >> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to
> see.
> >> >> >> > >> > >> >>> >>>>>>>
> >> >> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
> >> >> >> > >> > >> >>> >>>>>>> should we drop
> >> >> >> > >> > >> jdk6
> >> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by
> Oracle
> >> >> >> > >> > >> >>> >>>>>>> but I suspect
> >> >> >> > >> > >> some
> >> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta
> move
> >> >> >> > >> > >> >>> >>>>>>> forward though,
> >> >> >> > >> > >> >>> can't
> >> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we
> are
> >> >> >> > currently
> >> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and
> 8.
> >> >> >> > >> > >> >>> >>>>>>>
> >> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> >> >> > >> > >> >>> >>>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
> >> >> >> > >> > >> >>> >>>>
> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
> >> >> >> > >> > >> >>> >>>>>> Observers with the
> >> >> >> > >> > >> >>> upcoming
> >> >> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
> >> >> >> wild...).
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>> -rgs
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>> Patrick
> >> >> >> > >> > >> >>> >>>>>>>
> >> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
> >> Junqueira
> >> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in
> already,
> >> >> >> > >> > >> >>> >>>>>>>> but I need a +1
> >> >> >> > >> > >> >>> >>>> there. I
> >> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
> >> >> >> > >> > >> >>> >>>>>>> failed in the
> >> >> >> > >> > >> last QA
> >> >> >> > >> > >> >>> run
> >> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect
> >> LETest,
> >> >> >> > >> > >> >>> >>>>>>> and
> >> >> >> > in
> >> >> >> > >> > >> >>> >>>>>>> fact
> >> >> >> > >> > >> it
> >> >> >> > >> > >> >>> fails
> >> >> >> > >> > >> >>> >>>> in
> >> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure
> >> doesn't
> >> >> >> > >> > >> >>> >>>>>>> seem
> >> >> >> > to
> >> >> >> > >> > >> >>> >>>>>>> be
> >> >> >> > >> > >> >>> related
> >> >> >> > >> > >> >>> >>>> to the
> >> >> >> > >> > >> >>> >>>>>>> patch.
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
> >> >> >> > >> > >> >>> >>>>>>>> anything concrete
> >> >> >> > >> > >> about
> >> >> >> > >> > >> >>> it.
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>> -Flavio
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick
> >> Hunt <
> >> >> >> > >> > >> phunt@apache.org>
> >> >> >> > >> > >> >>> >>>> wrote:
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can
> get
> >> >> >> > >> > >> reviewed/finalized
> >> >> >> > >> > >> >>> before
> >> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try
> cutting
> >> an
> >> >> >> > >> > >> >>> >>>>>>>>> RC
> >> >> >> > >> > soonish...
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>> Patrick
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
> >> >> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
> >> >> wrote:
> >> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha
> versions.
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and
> >> ZK-1863
> >> >> in.
> >> >> >> > >> > >> >>> >>>>>>>>>> They are
> >> >> >> > >> > >> both
> >> >> >> > >> > >> >>> >>>> patch
> >> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is
> still
> >> >> >> > >> > >> >>> >>>>>>> open because we
> >> >> >> > >> > >> >>> need a
> >> >> >> > >> > >> >>> >>>> 3.4
> >> >> >> > >> > >> >>> >>>>>>> patch.
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> >> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >> >> >> > >> > >> >>> wrote:
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it
> for
> >> a
> >> >> >> > while, a
> >> >> >> > >> > >> >>> >>>>>>>>>>> few
> >> >> >> > >> > >> >>> people
> >> >> >> > >> > >> >>> >>>> have
> >> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as
> contacted
> >> me
> >> >> >> > >> > >> >>> >>>>>>>>>>> personally
> >> >> >> > >> > >> that
> >> >> >> > >> > >> >>> they
> >> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the
> >> first
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
> >> >> >> > >> > release.
> >> >> >> > >> > >> Every
> >> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
> >> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
> >> >> >> > >> > >> never
> >> >> >> > >> > >> >>> get
> >> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of
> >> great
> >> >> >> > >> > >> >>> >>>>>>>>>>> new features,
> >> >> >> > >> > >> >>> lots of
> >> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a
> release so
> >> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
> >> >> >> > >> > >> use
> >> >> >> > >> > >> >>> it,
> >> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable
> except
> >> >> >> > >> > >> >>> >>>>>>>>>>> for the known
> >> >> >> > >> > >> >>> flakey
> >> >> >> > >> > >> >>> >>>> test
> >> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed
> >> today to
> >> >> >> > >> > trunk.
> >> >> >> > >> > >> >>> >>>>>>>>>>> Note
> >> >> >> > >> > >> that
> >> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the
> code on
> >> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
> >> >> >> > and
> >> >> >> > >> > jdk8.
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we
> should
> >> plan
> >> >> >> > >> > >> >>> >>>>>>>>>>> our
> >> >> >> > >> > >> releases:
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> >> >> >> > for
> >> >> >> > >> > >> >>> >>>>>>>>>>> some
> >> >> >> > >> > >> time.
> >> >> >> > >> > >> >>> >>>> What I
> >> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to
> >> what
> >> >> >> > >> > >> >>> >>>>>>>>>>> we
> >> >> >> > did
> >> >> >> > >> > >> >>> >>>>>>>>>>> for
> >> >> >> > >> > >> 3.4.
> >> >> >> > >> > >> >>> >>>> (this is
> >> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during
> >> their
> >> >> >> > Hadoop 2
> >> >> >> > >> > >> release
> >> >> >> > >> > >> >>> >>>> cycle)
> >> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
> >> >> >> > >> > >> >>> >>>>>>>>>>> something people
> >> >> >> > >> > >> can run
> >> >> >> > >> > >> >>> >>>> and
> >> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the
> blockers
> >> >> >> > >> > >> >>> >>>>>>>>>>> and
> >> >> >> > feel
> >> >> >> > >> > >> >>> comfortable
> >> >> >> > >> > >> >>> >>>> with
> >> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then
> switch
> >> to
> >> >> >> beta.
> >> >> >> > >> > >> >>> >>>>>>>>>>> Once we
> >> >> >> > >> > >> get
> >> >> >> > >> > >> >>> >>>> some
> >> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta
> >> moniker
> >> >> >> > >> > and
> >> >> >> > >> > >> >>> >>>>>>>>>>> look at
> >> >> >> > >> > >> >>> making
> >> >> >> > >> > >> >>> >>>> it
> >> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will
> >> become
> >> >> >> > >> > >> >>> >>>>>>>>>>> the
> >> >> >> > >> > >> >>> "current/stable"
> >> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
> >> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
> >> 3.5.3-beta
> >> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta
> but
> >> >> >> > >> > >> >>> >>>>>>>>>>> also not
> >> >> >> > >> > >> "stable" vs
> >> >> >> > >> > >> >>> >>>> 3.4.x,
> >> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
> >> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
> >> >> >> > >> > >> shake
> >> >> >> > >> > >> >>> >>>> out)
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
> >> >> >> > >> > >> >>> >>>>>>>>>>> ....
> >> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases
> for
> >> >> >> > production
> >> >> >> > >> > >> >>> >>>>>>>>>>> use,
> >> >> >> > >> > >> >>> stable,
> >> >> >> > >> > >> >>> >>>>>>> etc...
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any
> of
> >> >> >> > >> > >> >>> >>>>>>>>>>> these something
> >> >> >> > >> > >> that
> >> >> >> > >> > >> >>> >>>> should
> >> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a
> couple
> >> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
> >> >> >> > >> > >> find
> >> >> >> > >> > >> >>> >>>> this a
> >> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
> >> rolling to
> >> >> >> > >> > >> >>> >>>>>>>>>>> cut
> >> >> >> > an
> >> >> >> > >> RC.
> >> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
> >> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>>>
> >> >> >> > >> > >> >>> >>>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >>>>>>
> >> >> >> > >> > >> >>> >
> >> >> >> > >> > >> >>>
> >> >> >> > >> > >>
> >> >> >> > >>
> >> >> >> > >>
> >> >> >> >
> >> >> >>
> >> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
On Tue, Jul 22, 2014 at 11:29 AM, Alexander Shraer <sh...@gmail.com> wrote:
> Hmm. It doesn't really make sense to me - the reconfig should be completed
> before
> the servers come up and process new ops. We submitted the reconfig to
> server 1, it timed out
> on new quorum, but when 1 becomes leader again after 2 restarts 1 should
> complete the reconfig.
> is 1 becoming leader after 2 restarts ?
>

What should I look for in the logs? Any specific log messages that
would help debug?

> About admin controls - reconfig/getConfig are open to everyone, unless you
> set permissions on the configuration znode being written during reconfig.
>                nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);
>
>                 checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
> request.authInfo);
>

So I can turn off all access then? (read and write). Should we ship
that as the default? We should add that to the docs.

In the past we've always tried to hide this type of information from
clients (e.g. we don't expose the zk server address to the client for
a session). This seems like a very big departure. Why didn't we move
it to a separate, administrative, interface?

Patrick

>
>
> On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com> wrote:
>
>> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
>> include any msg detail, but that's the way it looks to me like the
>> test is setup):
>>
>>         if (leavingServers != null) {
>>             for (String leaving : leavingServers)
>>
>> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>>         }
>>
>> which is called from:
>>
>>         qu.restart(2);
>>         // Now that 2 is back up, they'll complete the reconfig removing 3
>> and
>>         // can process other ops.
>>         testServerHasConfig(zkArr[1], null, leavingServers);
>>
>> It seems like the problem is that testServerHasConfig is not waiting
>> for the configuration to be updated? In this case 2 was just restarted
>> and 3 hasn't had a chance to be removed? (on a slower machine say,
>> which might be why you aren't seeing the issue? hence the flakeyness)
>>
>> Patrick
>>
>> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> > Hi Patrick, I'm not sure why you're seeing this - it consistently passes
>> on
>> > my machine. In case you'd like to take a look, the test has tons of
>> > comments explaining the scenario. Let me know how I can help.
>> >
>> >
>> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>> >> multiple times (not every time, but ~50%, so flakey) in the last few
>> >> days. It's failing both on jdk6 and jdk7. (this is my personal
>> >> jenkins, I haven't see any other failures than this during the past
>> >> few days).
>> >>
>> >> junit.framework.AssertionFailedError
>> >> at
>> >>
>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>> >> at
>> >>
>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>> >> at
>> >>
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>> >>
>> >> Patrick
>> >>
>> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
>> >> wrote:
>> >> > Hi Rakesh,
>> >> >
>> >> > Thanks for looking at this. In general even if we find the bug since
>> we
>> >> > should test it before committing a fix, it seems better to remove the
>> >> test
>> >> > for now and debug this on a build machine. I'm trying to get access to
>> >> it.
>> >> >
>> >> > Looking at this log:
>> >> >
>> >>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>> >> >
>> >> > Something weird is going on. Sever 3 hasn't started yet, but version
>> >> 200000000
>> >> > is already being sent around as committed!
>> >> >
>> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>> >> >
>> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
>> >> > - 2 Received version: 200000000 my version: 0
>> >> >
>> >> >
>> >> > and also in leader election messages.
>> >> >
>> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
>> whereas we
>> >> > explicitly started it with 100000000.
>> >> > Then it makes sense that the new config can't be committed since its
>> >> > version is not high enough...
>> >> >
>> >> > I wonder if its possible that not all servers from the previous test
>> are
>> >> > dead and they are interfering...
>> >> >
>> >> >
>> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
>> >> >
>> >> >> Hi Alex,
>> >> >>
>> >> >> Yeah it is consistently passing in my machine also.
>> >> >>
>> >> >>
>> >> >> I have quickly gone through the
>> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
>> >> has
>> >> >> not taken and still leader election is seeing 100000000 (n.config
>> >> version).
>> >> >> Unfortunately I didn't find the reason for not considering the
>> updated
>> >> >> config version.
>> >> >>
>> >> >>
>> >> >> Reference:
>> >> >>
>> >>
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>> >> >>
>> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>> >> >> Notification time out: 51200
>> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>> >> >> Notification time out: 51200
>> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >>
>> >> >>
>> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
>> (my
>> >> >> state)100000000 (n.config version)
>> >> >>
>> >> >>
>> >> >> -Rakesh
>> >> >>
>> >> >> -----Original Message-----
>> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>> >> >> Sent: 22 July 2014 11:57
>> >> >> To: dev@zookeeper.apache.org
>> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> >>
>> >> >> I tried to look into it, but the test consistently passes locally on
>> two
>> >> >> machines.
>> >> >> I don't currently have access to the build machine, but I can try to
>> ask
>> >> >> for access.
>> >> >> Unless anyone has a better suggestion, we could remove the failing
>> test
>> >> in
>> >> >> the meanwhile and open a JIRA to add it back...
>> >> >>
>> >> >>
>> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>> >> wrote:
>> >> >>
>> >> >> > I'm seeing alot of test failures in
>> >> >> > testCurrentObserverIsParticipantInNewConfig could someone take a
>> look?
>> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
>> >> >> >
>> >> >> >
>> >> >> >
>> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> >> >> >
>> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> >> >> > tabpanel#comment-14069024
>> >> >> >
>> >> >> > Patrick
>> >> >> >
>> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> >> >> > <ra...@gmail.com> wrote:
>> >> >> > > lgtm +1
>> >> >> > >
>> >> >> > >
>> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> >> >> > > <fp...@yahoo.com.invalid>
>> >> >> > wrote:
>> >> >> > >
>> >> >> > >> +1 for having an RC this week. Since this is an alpha release, I
>> >> >> > >> +think
>> >> >> > 72
>> >> >> > >> biz hours is enough for the vote.
>> >> >> > >>
>> >> >> > >> -Flavio
>> >> >> > >>
>> >> >> > >> > -----Original Message-----
>> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> >> >> > >> > Sent: 21 July 2014 18:55
>> >> >> > >> > To: DevZooKeeper
>> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> >> > >> >
>> >> >> > >> > I fixed a number of issues. I also started a few threads with
>> >> >> > >> > builds@
>> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
>> >> >> > through a
>> >> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
>> >> >> close.
>> >> >> > >> >
>> >> >> > >> > I don't see why we can't create an RC and start voting this
>> week
>> >> >> > though.
>> >> >> > >> > Anyone disagree?
>> >> >> > >> >
>> >> >> > >> > How long should we let the vote run, the std 72 biz hours or
>> >> >> > >> > should we
>> >> >> > >> plan
>> >> >> > >> > for more to allow folks more time to test?
>> >> >> > >> >
>> >> >> > >> > Patrick
>> >> >> > >> >
>> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> >> >> > >> > <rg...@itevenworks.net> wrote:
>> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >> >> > >> > >
>> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs -
>> I'm
>> >> >> > trying
>> >> >> > >> > >> to fix some of the jobs that were broken during the recent
>> >> >> > >> > >> host upgrade.
>> >> >> > >> > >>
>> >> >> > >> > >
>> >> >> > >> > > How are things looking? Is it likely that we can have a
>> 3.5.0
>> >> >> > >> > > alpha release week or are we still blocked on Jenkins?
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > > -rgs
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > >
>> >> >> > >> > >> Patrick
>> >> >> > >> > >>
>> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> >> >> > >> > >> <mi...@cs.stanford.edu>
>> >> >> > >> > >> wrote:
>> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>> >> >> > >> > >> >
>> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> >> >> > >> > >> > <sh...@gmail.com>
>> >> >> > >> > >> wrote:
>> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1
>> and
>> >> >> > >> > >> >> all
>> >> >> > >> > >> subsequent
>> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>> >> >> > >> > >> >>
>> >> >> > >> > >> >>
>> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> >> >> > >> > >> >> <phunt@apache.org
>> >> >> > >
>> >> >> > >> > wrote:
>> >> >> > >> > >> >>
>> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
>> >> >> > >> > >> >>> for
>> >> >> > e.g.
>> >> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
>> >> >> > >> > >> >>> startServer fix
>> >> >> > and
>> >> >> > >> > >> >>> some other flakeys that have been addressed go a long
>> way
>> >> >> > >> > >> >>> on
>> >> >> > that
>> >> >> > >> > >> >>> issue) but I think the findbugs problem should be
>> cleaned
>> >> >> > >> > >> >>> up before we cut a release. I started a separate
>> thread to
>> >> >> > >> > >> >>> discuss
>> >> >> > >> the
>> >> >> > >> > findbugs issue.
>> >> >> > >> > >> >>>
>> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> >> >> > >> > >> >>>
>> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>> >> >> > >> > >> >>>
>> >> >> > >> > >> >>> Patrick
>> >> >> > >> > >> >>>
>> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> >> >> > >> > >> >>> > My take:
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it
>> can
>> >> >> > >> > >> >>> > go
>> >> >> > in.
>> >> >> > >> > >> >>> > See
>> >> >> > >> > >> the
>> >> >> > >> > >> >>> jira for comments.
>> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>> >> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>> >> >> > >> > >> >>> > ZK-1919
>> >> >> > and
>> >> >> > >> > >> ZK-1910,
>> >> >> > >> > >> >>> we can leave it for the second alpha.
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>> > If you agree with this, then we should be able to
>> cut a
>> >> >> > >> > >> >>> > candidate by
>> >> >> > >> > >> the
>> >> >> > >> > >> >>> end of this week.
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>> > -Flavio
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> >> >> > >> > >> >>> > <ph...@apache.org>
>> >> >> > >> wrote:
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>> >> Per my previous note you can now see the c client
>> test
>> >> >> > >> > >> >>> >> log output
>> >> >> > >> > >> here
>> >> >> > >> > >> >>> >> in the "build artifacts" section:
>> >> >> > >> > >> >>> >>
>> >> >> > >> > >> >>>
>> >> >> > >> > >>
>> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> >> >> > >> > >> r-
>> >> >> > >> > trunk
>> >> >> > >> > >> /2372/
>> >> >> > >> > >> >>> >>
>> >> >> > >> > >> >>> >> Patrick
>> >> >> > >> > >> >>> >>
>> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> >> >> > >> > >> >>> >> <ph...@apache.org>
>> >> >> > >> > >> wrote:
>> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not
>> clear
>> >> >> > >> > >> >>> >>> to me which
>> >> >> > >> > >> >>> >>> one(s?) is new?)
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
>> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
>> >> >> > >> > >> >>> >>> updated the "archive" list
>> >> >> > >> > >> to
>> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it
>> won't
>> >> >> > >> > >> >>> >>> go
>> >> >> > to
>> >> >> > >> > >> console
>> >> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the
>> jetty
>> >> >> > server
>> >> >> > >> > >> support
>> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
>> reviewing
>> >> >> > >> > >> "ZOOKEEPER-1919
>> >> >> > >> > >> >>> >>> Update the C implementation of removeWatches to
>> have
>> >> >> > >> > >> >>> >>> it
>> >> >> > >> > match
>> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>> >> >> > >> Flavio/Michi?
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c
>> client
>> >> >> > >> > >> >>> >>> reconfig
>> >> >> > >> > >> test -
>> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Based on previous comments it looks like we're
>> pretty
>> >> >> > close.
>> >> >> > >> > >> >>> >>> Do
>> >> >> > >> > >> folks
>> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
>> >> >> > >> > >> >>> >>> (with a few
>> >> >> > >> > >> pending
>> >> >> > >> > >> >>> >>> as above)
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> Patrick
>> >> >> > >> > >> >>> >>>
>> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>> >> >> > >> > >> >>> >>>> wrote:
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
>> >> >> > >> > >> >>> >>>>> release
>> >> >> > an
>> >> >> > >> > >> >>> >>>>> alpha
>> >> >> > >> > >> >>> version
>> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
>> >> >> > >> > >> >>> >>>> weeks
>> >> >> > time?
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>
>> >> >> > >> > >> >>> >>>> +1
>> >> >> > >> > >> >>> >>>>
>> >> >> > >> > >> >>> >>>> -rgs
>> >> >> > >> > >> >>> >>>>
>> >> >> > >> > >> >>> >>>>> -Flavio
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
>> >> >> > Segalés <
>> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>> >> >> > >> > wrote:
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0
>> (from 8
>> >> >> > >> > >> >>> >>>>>>> in
>> >> >> > the
>> >> >> > >> > >> >>> >>>>>>> last
>> >> >> > >> > >> >>> check).
>> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>> >> >> > >> > >> >>> >>>>>>> Camille is
>> >> >> > >> > >> threatening
>> >> >> > >> > >> >>> to
>> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in
>> general
>> >> >> > >> > >> >>> >>>>>>> on
>> >> >> > the
>> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
>> >> >> > >> > >> >>> >>>>>>>
>> >> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>> >> >> > >> > >> >>> >>>>>>> should we drop
>> >> >> > >> > >> jdk6
>> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
>> >> >> > >> > >> >>> >>>>>>> but I suspect
>> >> >> > >> > >> some
>> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
>> >> >> > >> > >> >>> >>>>>>> forward though,
>> >> >> > >> > >> >>> can't
>> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
>> >> >> > currently
>> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> >> >> > >> > >> >>> >>>>>>>
>> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >> >> > >> > >> >>> >>>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
>> >> >> > >> > >> >>> >>>>
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>> >> >> > >> > >> >>> >>>>>> Observers with the
>> >> >> > >> > >> >>> upcoming
>> >> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>> >> >> wild...).
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>> -rgs
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>> Patrick
>> >> >> > >> > >> >>> >>>>>>>
>> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
>> Junqueira
>> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
>> >> >> > >> > >> >>> >>>>>>>> but I need a +1
>> >> >> > >> > >> >>> >>>> there. I
>> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>> >> >> > >> > >> >>> >>>>>>> failed in the
>> >> >> > >> > >> last QA
>> >> >> > >> > >> >>> run
>> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect
>> LETest,
>> >> >> > >> > >> >>> >>>>>>> and
>> >> >> > in
>> >> >> > >> > >> >>> >>>>>>> fact
>> >> >> > >> > >> it
>> >> >> > >> > >> >>> fails
>> >> >> > >> > >> >>> >>>> in
>> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure
>> doesn't
>> >> >> > >> > >> >>> >>>>>>> seem
>> >> >> > to
>> >> >> > >> > >> >>> >>>>>>> be
>> >> >> > >> > >> >>> related
>> >> >> > >> > >> >>> >>>> to the
>> >> >> > >> > >> >>> >>>>>>> patch.
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>> >> >> > >> > >> >>> >>>>>>>> anything concrete
>> >> >> > >> > >> about
>> >> >> > >> > >> >>> it.
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>> -Flavio
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick
>> Hunt <
>> >> >> > >> > >> phunt@apache.org>
>> >> >> > >> > >> >>> >>>> wrote:
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> >> >> > >> > >> reviewed/finalized
>> >> >> > >> > >> >>> before
>> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting
>> an
>> >> >> > >> > >> >>> >>>>>>>>> RC
>> >> >> > >> > soonish...
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>> Patrick
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>> >> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>> >> wrote:
>> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and
>> ZK-1863
>> >> in.
>> >> >> > >> > >> >>> >>>>>>>>>> They are
>> >> >> > >> > >> both
>> >> >> > >> > >> >>> >>>> patch
>> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
>> >> >> > >> > >> >>> >>>>>>> open because we
>> >> >> > >> > >> >>> need a
>> >> >> > >> > >> >>> >>>> 3.4
>> >> >> > >> > >> >>> >>>>>>> patch.
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> >> >> > >> > >> >>> wrote:
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for
>> a
>> >> >> > while, a
>> >> >> > >> > >> >>> >>>>>>>>>>> few
>> >> >> > >> > >> >>> people
>> >> >> > >> > >> >>> >>>> have
>> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted
>> me
>> >> >> > >> > >> >>> >>>>>>>>>>> personally
>> >> >> > >> > >> that
>> >> >> > >> > >> >>> they
>> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the
>> first
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
>> >> >> > >> > release.
>> >> >> > >> > >> Every
>> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> >> >> > >> > >> never
>> >> >> > >> > >> >>> get
>> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of
>> great
>> >> >> > >> > >> >>> >>>>>>>>>>> new features,
>> >> >> > >> > >> >>> lots of
>> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
>> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
>> >> >> > >> > >> use
>> >> >> > >> > >> >>> it,
>> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
>> >> >> > >> > >> >>> >>>>>>>>>>> for the known
>> >> >> > >> > >> >>> flakey
>> >> >> > >> > >> >>> >>>> test
>> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed
>> today to
>> >> >> > >> > trunk.
>> >> >> > >> > >> >>> >>>>>>>>>>> Note
>> >> >> > >> > >> that
>> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
>> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
>> >> >> > and
>> >> >> > >> > jdk8.
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should
>> plan
>> >> >> > >> > >> >>> >>>>>>>>>>> our
>> >> >> > >> > >> releases:
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> >> >> > for
>> >> >> > >> > >> >>> >>>>>>>>>>> some
>> >> >> > >> > >> time.
>> >> >> > >> > >> >>> >>>> What I
>> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to
>> what
>> >> >> > >> > >> >>> >>>>>>>>>>> we
>> >> >> > did
>> >> >> > >> > >> >>> >>>>>>>>>>> for
>> >> >> > >> > >> 3.4.
>> >> >> > >> > >> >>> >>>> (this is
>> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during
>> their
>> >> >> > Hadoop 2
>> >> >> > >> > >> release
>> >> >> > >> > >> >>> >>>> cycle)
>> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>> >> >> > >> > >> >>> >>>>>>>>>>> something people
>> >> >> > >> > >> can run
>> >> >> > >> > >> >>> >>>> and
>> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
>> >> >> > >> > >> >>> >>>>>>>>>>> and
>> >> >> > feel
>> >> >> > >> > >> >>> comfortable
>> >> >> > >> > >> >>> >>>> with
>> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch
>> to
>> >> >> beta.
>> >> >> > >> > >> >>> >>>>>>>>>>> Once we
>> >> >> > >> > >> get
>> >> >> > >> > >> >>> >>>> some
>> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta
>> moniker
>> >> >> > >> > and
>> >> >> > >> > >> >>> >>>>>>>>>>> look at
>> >> >> > >> > >> >>> making
>> >> >> > >> > >> >>> >>>> it
>> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will
>> become
>> >> >> > >> > >> >>> >>>>>>>>>>> the
>> >> >> > >> > >> >>> "current/stable"
>> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
>> 3.5.3-beta
>> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
>> >> >> > >> > >> >>> >>>>>>>>>>> also not
>> >> >> > >> > >> "stable" vs
>> >> >> > >> > >> >>> >>>> 3.4.x,
>> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
>> >> >> > >> > >> shake
>> >> >> > >> > >> >>> >>>> out)
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> >> >> > >> > >> >>> >>>>>>>>>>> ....
>> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
>> >> >> > production
>> >> >> > >> > >> >>> >>>>>>>>>>> use,
>> >> >> > >> > >> >>> stable,
>> >> >> > >> > >> >>> >>>>>>> etc...
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
>> >> >> > >> > >> >>> >>>>>>>>>>> these something
>> >> >> > >> > >> that
>> >> >> > >> > >> >>> >>>> should
>> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
>> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>> >> >> > >> > >> find
>> >> >> > >> > >> >>> >>>> this a
>> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
>> rolling to
>> >> >> > >> > >> >>> >>>>>>>>>>> cut
>> >> >> > an
>> >> >> > >> RC.
>> >> >> > >> > >> >>> >>>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
>> >> >> > >> > >> >>> >>>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>>>
>> >> >> > >> > >> >>> >>>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >>>>>>
>> >> >> > >> > >> >>> >
>> >> >> > >> > >> >>>
>> >> >> > >> > >>
>> >> >> > >>
>> >> >> > >>
>> >> >> >
>> >> >>
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Hmm. It doesn't really make sense to me - the reconfig should be completed
before
the servers come up and process new ops. We submitted the reconfig to
server 1, it timed out
on new quorum, but when 1 becomes leader again after 2 restarts 1 should
complete the reconfig.
is 1 becoming leader after 2 restarts ?

About admin controls - reconfig/getConfig are open to everyone, unless you
set permissions on the configuration znode being written during reconfig.
               nodeRecord = getRecordForPath(ZooDefs.CONFIG_NODE);

                checkACL(zks, nodeRecord.acl, ZooDefs.Perms.WRITE,
request.authInfo);



On Tue, Jul 22, 2014 at 11:16 AM, Patrick Hunt <ph...@gmail.com> wrote:

> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
> include any msg detail, but that's the way it looks to me like the
> test is setup):
>
>         if (leavingServers != null) {
>             for (String leaving : leavingServers)
>
> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>         }
>
> which is called from:
>
>         qu.restart(2);
>         // Now that 2 is back up, they'll complete the reconfig removing 3
> and
>         // can process other ops.
>         testServerHasConfig(zkArr[1], null, leavingServers);
>
> It seems like the problem is that testServerHasConfig is not waiting
> for the configuration to be updated? In this case 2 was just restarted
> and 3 hasn't had a chance to be removed? (on a slower machine say,
> which might be why you aren't seeing the issue? hence the flakeyness)
>
> Patrick
>
> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
> > Hi Patrick, I'm not sure why you're seeing this - it consistently passes
> on
> > my machine. In case you'd like to take a look, the test has tons of
> > comments explaining the scenario. Let me know how I can help.
> >
> >
> > On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:
> >
> >> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
> >> multiple times (not every time, but ~50%, so flakey) in the last few
> >> days. It's failing both on jdk6 and jdk7. (this is my personal
> >> jenkins, I haven't see any other failures than this during the past
> >> few days).
> >>
> >> junit.framework.AssertionFailedError
> >> at
> >>
> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
> >> at
> >>
> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
> >> at
> >>
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
> >>
> >> Patrick
> >>
> >> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
> >> wrote:
> >> > Hi Rakesh,
> >> >
> >> > Thanks for looking at this. In general even if we find the bug since
> we
> >> > should test it before committing a fix, it seems better to remove the
> >> test
> >> > for now and debug this on a build machine. I'm trying to get access to
> >> it.
> >> >
> >> > Looking at this log:
> >> >
> >>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
> >> >
> >> > Something weird is going on. Sever 3 hasn't started yet, but version
> >> 200000000
> >> > is already being sent around as committed!
> >> >
> >> > 2014-07-21 10:44:50,901 [myid:2] - INFO
> >> >
> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
> >> > - 2 Received version: 200000000 my version: 0
> >> >
> >> >
> >> > and also in leader election messages.
> >> >
> >> > Also weird is that the version of 2 is 0 as if it is a joiner,
> whereas we
> >> > explicitly started it with 100000000.
> >> > Then it makes sense that the new config can't be committed since its
> >> > version is not high enough...
> >> >
> >> > I wonder if its possible that not all servers from the previous test
> are
> >> > dead and they are interfering...
> >> >
> >> >
> >> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
> >> >
> >> >> Hi Alex,
> >> >>
> >> >> Yeah it is consistently passing in my machine also.
> >> >>
> >> >>
> >> >> I have quickly gone through the
> >> >> testCurrentObserverIsParticipantInNewConfig failure logs in
> >> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
> >> has
> >> >> not taken and still leader election is seeing 100000000 (n.config
> >> version).
> >> >> Unfortunately I didn't find the reason for not considering the
> updated
> >> >> config version.
> >> >>
> >> >>
> >> >> Reference:
> >> >>
> >>
> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
> >> >>
> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
> >> >> Notification time out: 51200
> >> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
> >> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
> >> >> Notification time out: 51200
> >> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >>
> >> >>
> >> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
> >> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
> >> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> >> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING
> (my
> >> >> state)100000000 (n.config version)
> >> >>
> >> >>
> >> >> -Rakesh
> >> >>
> >> >> -----Original Message-----
> >> >> From: Alexander Shraer [mailto:shralex@gmail.com]
> >> >> Sent: 22 July 2014 11:57
> >> >> To: dev@zookeeper.apache.org
> >> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >>
> >> >> I tried to look into it, but the test consistently passes locally on
> two
> >> >> machines.
> >> >> I don't currently have access to the build machine, but I can try to
> ask
> >> >> for access.
> >> >> Unless anyone has a better suggestion, we could remove the failing
> test
> >> in
> >> >> the meanwhile and open a JIRA to add it back...
> >> >>
> >> >>
> >> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
> >> wrote:
> >> >>
> >> >> > I'm seeing alot of test failures in
> >> >> > testCurrentObserverIsParticipantInNewConfig could someone take a
> look?
> >> >> > Seems related to ZOOKEEPER-1807 recent commit.
> >> >> >
> >> >> >
> >> >> >
> >> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> >> >> >
> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> >> >> > tabpanel#comment-14069024
> >> >> >
> >> >> > Patrick
> >> >> >
> >> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> >> >> > <ra...@gmail.com> wrote:
> >> >> > > lgtm +1
> >> >> > >
> >> >> > >
> >> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
> >> >> > > <fp...@yahoo.com.invalid>
> >> >> > wrote:
> >> >> > >
> >> >> > >> +1 for having an RC this week. Since this is an alpha release, I
> >> >> > >> +think
> >> >> > 72
> >> >> > >> biz hours is enough for the vote.
> >> >> > >>
> >> >> > >> -Flavio
> >> >> > >>
> >> >> > >> > -----Original Message-----
> >> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >> >> > >> > Sent: 21 July 2014 18:55
> >> >> > >> > To: DevZooKeeper
> >> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >> > >> >
> >> >> > >> > I fixed a number of issues. I also started a few threads with
> >> >> > >> > builds@
> >> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
> >> >> > through a
> >> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
> >> >> close.
> >> >> > >> >
> >> >> > >> > I don't see why we can't create an RC and start voting this
> week
> >> >> > though.
> >> >> > >> > Anyone disagree?
> >> >> > >> >
> >> >> > >> > How long should we let the vote run, the std 72 biz hours or
> >> >> > >> > should we
> >> >> > >> plan
> >> >> > >> > for more to allow folks more time to test?
> >> >> > >> >
> >> >> > >> > Patrick
> >> >> > >> >
> >> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> >> >> > >> > <rg...@itevenworks.net> wrote:
> >> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org>
> wrote:
> >> >> > >> > >
> >> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs -
> I'm
> >> >> > trying
> >> >> > >> > >> to fix some of the jobs that were broken during the recent
> >> >> > >> > >> host upgrade.
> >> >> > >> > >>
> >> >> > >> > >
> >> >> > >> > > How are things looking? Is it likely that we can have a
> 3.5.0
> >> >> > >> > > alpha release week or are we still blocked on Jenkins?
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > > -rgs
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > >
> >> >> > >> > >> Patrick
> >> >> > >> > >>
> >> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >> >> > >> > >> <mi...@cs.stanford.edu>
> >> >> > >> > >> wrote:
> >> >> > >> > >> > I'll check in ZOOKEEPER-1683.
> >> >> > >> > >> >
> >> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >> >> > >> > >> > <sh...@gmail.com>
> >> >> > >> > >> wrote:
> >> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1
> and
> >> >> > >> > >> >> all
> >> >> > >> > >> subsequent
> >> >> > >> > >> >> changes were formatting as suggested by Rakesh.
> >> >> > >> > >> >>
> >> >> > >> > >> >>
> >> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
> >> >> > >> > >> >> <phunt@apache.org
> >> >> > >
> >> >> > >> > wrote:
> >> >> > >> > >> >>
> >> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
> >> >> > >> > >> >>> for
> >> >> > e.g.
> >> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
> >> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
> >> >> > >> > >> >>> startServer fix
> >> >> > and
> >> >> > >> > >> >>> some other flakeys that have been addressed go a long
> way
> >> >> > >> > >> >>> on
> >> >> > that
> >> >> > >> > >> >>> issue) but I think the findbugs problem should be
> cleaned
> >> >> > >> > >> >>> up before we cut a release. I started a separate
> thread to
> >> >> > >> > >> >>> discuss
> >> >> > >> the
> >> >> > >> > findbugs issue.
> >> >> > >> > >> >>>
> >> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> >> > >> > >> >>>
> >> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> >> >> > >> > >> >>>
> >> >> > >> > >> >>> Patrick
> >> >> > >> > >> >>>
> >> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >> >> > >> > >> >>> > My take:
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it
> can
> >> >> > >> > >> >>> > go
> >> >> > in.
> >> >> > >> > >> >>> > See
> >> >> > >> > >> the
> >> >> > >> > >> >>> jira for comments.
> >> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> >> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
> >> >> > >> > >> >>> > ZK-1919
> >> >> > and
> >> >> > >> > >> ZK-1910,
> >> >> > >> > >> >>> we can leave it for the second alpha.
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>> > If you agree with this, then we should be able to
> cut a
> >> >> > >> > >> >>> > candidate by
> >> >> > >> > >> the
> >> >> > >> > >> >>> end of this week.
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>> > -Flavio
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
> >> >> > >> > >> >>> > <ph...@apache.org>
> >> >> > >> wrote:
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>> >> Per my previous note you can now see the c client
> test
> >> >> > >> > >> >>> >> log output
> >> >> > >> > >> here
> >> >> > >> > >> >>> >> in the "build artifacts" section:
> >> >> > >> > >> >>> >>
> >> >> > >> > >> >>>
> >> >> > >> > >>
> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> >> >> > >> > >> r-
> >> >> > >> > trunk
> >> >> > >> > >> /2372/
> >> >> > >> > >> >>> >>
> >> >> > >> > >> >>> >> Patrick
> >> >> > >> > >> >>> >>
> >> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >> >> > >> > >> >>> >> <ph...@apache.org>
> >> >> > >> > >> wrote:
> >> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not
> clear
> >> >> > >> > >> >>> >>> to me which
> >> >> > >> > >> >>> >>> one(s?) is new?)
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
> >> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
> >> >> > >> > >> >>> >>> updated the "archive" list
> >> >> > >> > >> to
> >> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it
> won't
> >> >> > >> > >> >>> >>> go
> >> >> > to
> >> >> > >> > >> console
> >> >> > >> > >> >>> >>> at least we can debug when there is a failure.
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the
> jetty
> >> >> > server
> >> >> > >> > >> support
> >> >> > >> > >> >>> >>> and it looks like that should be ready soon.
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Raul also requested that someone prioritize
> reviewing
> >> >> > >> > >> "ZOOKEEPER-1919
> >> >> > >> > >> >>> >>> Update the C implementation of removeWatches to
> have
> >> >> > >> > >> >>> >>> it
> >> >> > >> > match
> >> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> >> >> > >> Flavio/Michi?
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c
> client
> >> >> > >> > >> >>> >>> reconfig
> >> >> > >> > >> test -
> >> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Based on previous comments it looks like we're
> pretty
> >> >> > close.
> >> >> > >> > >> >>> >>> Do
> >> >> > >> > >> folks
> >> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
> >> >> > >> > >> >>> >>> (with a few
> >> >> > >> > >> pending
> >> >> > >> > >> >>> >>> as above)
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> Patrick
> >> >> > >> > >> >>> >>>
> >> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
> >> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> >> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> >> > >> > >> >>> <fp...@yahoo.com.invalid>
> >> >> > >> > >> >>> >>>> wrote:
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
> >> >> > >> > >> >>> >>>>> release
> >> >> > an
> >> >> > >> > >> >>> >>>>> alpha
> >> >> > >> > >> >>> version
> >> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
> >> >> > >> > >> >>> >>>> weeks
> >> >> > time?
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>
> >> >> > >> > >> >>> >>>> +1
> >> >> > >> > >> >>> >>>>
> >> >> > >> > >> >>> >>>> -rgs
> >> >> > >> > >> >>> >>>>
> >> >> > >> > >> >>> >>>>> -Flavio
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
> >> >> > Segalés <
> >> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
> >> >> > >> > >> >>> >>>>>> <ph...@apache.org>
> >> >> > >> > wrote:
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0
> (from 8
> >> >> > >> > >> >>> >>>>>>> in
> >> >> > the
> >> >> > >> > >> >>> >>>>>>> last
> >> >> > >> > >> >>> check).
> >> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
> >> >> > >> > >> >>> >>>>>>> Camille is
> >> >> > >> > >> threatening
> >> >> > >> > >> >>> to
> >> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in
> general
> >> >> > >> > >> >>> >>>>>>> on
> >> >> > the
> >> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
> >> >> > >> > >> >>> >>>>>>>
> >> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
> >> >> > >> > >> >>> >>>>>>> should we drop
> >> >> > >> > >> jdk6
> >> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
> >> >> > >> > >> >>> >>>>>>> but I suspect
> >> >> > >> > >> some
> >> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
> >> >> > >> > >> >>> >>>>>>> forward though,
> >> >> > >> > >> >>> can't
> >> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
> >> >> > currently
> >> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >> >> > >> > >> >>> >>>>>>>
> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> >> > >> > >> >>> >>>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>> Extra eyes/review for
> >> >> > >> > >> >>> >>>>
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
> >> >> > >> > >> >>> >>>>>> Observers with the
> >> >> > >> > >> >>> upcoming
> >> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
> >> >> wild...).
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>> -rgs
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>> Patrick
> >> >> > >> > >> >>> >>>>>>>
> >> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio
> Junqueira
> >> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
> >> >> > >> > >> >>> >>>>>>>> but I need a +1
> >> >> > >> > >> >>> >>>> there. I
> >> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
> >> >> > >> > >> >>> >>>>>>> failed in the
> >> >> > >> > >> last QA
> >> >> > >> > >> >>> run
> >> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect
> LETest,
> >> >> > >> > >> >>> >>>>>>> and
> >> >> > in
> >> >> > >> > >> >>> >>>>>>> fact
> >> >> > >> > >> it
> >> >> > >> > >> >>> fails
> >> >> > >> > >> >>> >>>> in
> >> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure
> doesn't
> >> >> > >> > >> >>> >>>>>>> seem
> >> >> > to
> >> >> > >> > >> >>> >>>>>>> be
> >> >> > >> > >> >>> related
> >> >> > >> > >> >>> >>>> to the
> >> >> > >> > >> >>> >>>>>>> patch.
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
> >> >> > >> > >> >>> >>>>>>>> anything concrete
> >> >> > >> > >> about
> >> >> > >> > >> >>> it.
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>> -Flavio
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick
> Hunt <
> >> >> > >> > >> phunt@apache.org>
> >> >> > >> > >> >>> >>>> wrote:
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> >> >> > >> > >> reviewed/finalized
> >> >> > >> > >> >>> before
> >> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting
> an
> >> >> > >> > >> >>> >>>>>>>>> RC
> >> >> > >> > soonish...
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>> Patrick
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
> >> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
> >> wrote:
> >> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and
> ZK-1863
> >> in.
> >> >> > >> > >> >>> >>>>>>>>>> They are
> >> >> > >> > >> both
> >> >> > >> > >> >>> >>>> patch
> >> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
> >> >> > >> > >> >>> >>>>>>> open because we
> >> >> > >> > >> >>> need a
> >> >> > >> > >> >>> >>>> 3.4
> >> >> > >> > >> >>> >>>>>>> patch.
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>> -Flavio
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> >> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >> >> > >> > >> >>> wrote:
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for
> a
> >> >> > while, a
> >> >> > >> > >> >>> >>>>>>>>>>> few
> >> >> > >> > >> >>> people
> >> >> > >> > >> >>> >>>> have
> >> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted
> me
> >> >> > >> > >> >>> >>>>>>>>>>> personally
> >> >> > >> > >> that
> >> >> > >> > >> >>> they
> >> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the
> first
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5
> >> >> > >> > release.
> >> >> > >> > >> Every
> >> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
> >> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
> >> >> > >> > >> never
> >> >> > >> > >> >>> get
> >> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of
> great
> >> >> > >> > >> >>> >>>>>>>>>>> new features,
> >> >> > >> > >> >>> lots of
> >> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
> >> >> > >> > >> >>> >>>>>>>>>>> that folks can
> >> >> > >> > >> use
> >> >> > >> > >> >>> it,
> >> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
> >> >> > >> > >> >>> >>>>>>>>>>> for the known
> >> >> > >> > >> >>> flakey
> >> >> > >> > >> >>> >>>> test
> >> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed
> today to
> >> >> > >> > trunk.
> >> >> > >> > >> >>> >>>>>>>>>>> Note
> >> >> > >> > >> that
> >> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
> >> >> > >> > >> >>> >>>>>>>>>>> jdk7
> >> >> > and
> >> >> > >> > jdk8.
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should
> plan
> >> >> > >> > >> >>> >>>>>>>>>>> our
> >> >> > >> > >> releases:
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> >> >> > for
> >> >> > >> > >> >>> >>>>>>>>>>> some
> >> >> > >> > >> time.
> >> >> > >> > >> >>> >>>> What I
> >> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to
> what
> >> >> > >> > >> >>> >>>>>>>>>>> we
> >> >> > did
> >> >> > >> > >> >>> >>>>>>>>>>> for
> >> >> > >> > >> 3.4.
> >> >> > >> > >> >>> >>>> (this is
> >> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during
> their
> >> >> > Hadoop 2
> >> >> > >> > >> release
> >> >> > >> > >> >>> >>>> cycle)
> >> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
> >> >> > >> > >> >>> >>>>>>>>>>> something people
> >> >> > >> > >> can run
> >> >> > >> > >> >>> >>>> and
> >> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
> >> >> > >> > >> >>> >>>>>>>>>>> and
> >> >> > feel
> >> >> > >> > >> >>> comfortable
> >> >> > >> > >> >>> >>>> with
> >> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch
> to
> >> >> beta.
> >> >> > >> > >> >>> >>>>>>>>>>> Once we
> >> >> > >> > >> get
> >> >> > >> > >> >>> >>>> some
> >> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta
> moniker
> >> >> > >> > and
> >> >> > >> > >> >>> >>>>>>>>>>> look at
> >> >> > >> > >> >>> making
> >> >> > >> > >> >>> >>>> it
> >> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will
> become
> >> >> > >> > >> >>> >>>>>>>>>>> the
> >> >> > >> > >> >>> "current/stable"
> >> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> e.g.
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
> >> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers)
> 3.5.3-beta
> >> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
> >> >> > >> > >> >>> >>>>>>>>>>> also not
> >> >> > >> > >> "stable" vs
> >> >> > >> > >> >>> >>>> 3.4.x,
> >> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
> >> >> > >> > >> >>> >>>>>>>>>>> expect things to
> >> >> > >> > >> shake
> >> >> > >> > >> >>> >>>> out)
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
> >> >> > >> > >> >>> >>>>>>>>>>> ....
> >> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
> >> >> > production
> >> >> > >> > >> >>> >>>>>>>>>>> use,
> >> >> > >> > >> >>> stable,
> >> >> > >> > >> >>> >>>>>>> etc...
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
> >> >> > >> > >> >>> >>>>>>>>>>> these something
> >> >> > >> > >> that
> >> >> > >> > >> >>> >>>> should
> >> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
> >> >> > >> > >> >>> >>>>>>>>>>> days. If folks
> >> >> > >> > >> find
> >> >> > >> > >> >>> >>>> this a
> >> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball
> rolling to
> >> >> > >> > >> >>> >>>>>>>>>>> cut
> >> >> > an
> >> >> > >> RC.
> >> >> > >> > >> >>> >>>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>>> Patrick
> >> >> > >> > >> >>> >>>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>>>
> >> >> > >> > >> >>> >>>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >>>>>>
> >> >> > >> > >> >>> >
> >> >> > >> > >> >>>
> >> >> > >> > >>
> >> >> > >>
> >> >> > >>
> >> >> >
> >> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@gmail.com>.
Looks like 3 hasn't been removed (unfortunately the assertion doesn't
include any msg detail, but that's the way it looks to me like the
test is setup):

        if (leavingServers != null) {
            for (String leaving : leavingServers)

Assert.assertFalse(configStr.contains("server.".concat(leaving)));
        }

which is called from:

        qu.restart(2);
        // Now that 2 is back up, they'll complete the reconfig removing 3 and
        // can process other ops.
        testServerHasConfig(zkArr[1], null, leavingServers);

It seems like the problem is that testServerHasConfig is not waiting
for the configuration to be updated? In this case 2 was just restarted
and 3 hasn't had a chance to be removed? (on a slower machine say,
which might be why you aren't seeing the issue? hence the flakeyness)

Patrick

On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com> wrote:
> Hi Patrick, I'm not sure why you're seeing this - it consistently passes on
> my machine. In case you'd like to take a look, the test has tons of
> comments explaining the scenario. Let me know how I can help.
>
>
> On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>
>> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>> multiple times (not every time, but ~50%, so flakey) in the last few
>> days. It's failing both on jdk6 and jdk7. (this is my personal
>> jenkins, I haven't see any other failures than this during the past
>> few days).
>>
>> junit.framework.AssertionFailedError
>> at
>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>> at
>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>> at
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>
>> Patrick
>>
>> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> > Hi Rakesh,
>> >
>> > Thanks for looking at this. In general even if we find the bug since we
>> > should test it before committing a fix, it seems better to remove the
>> test
>> > for now and debug this on a build machine. I'm trying to get access to
>> it.
>> >
>> > Looking at this log:
>> >
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>> >
>> > Something weird is going on. Sever 3 hasn't started yet, but version
>> 200000000
>> > is already being sent around as committed!
>> >
>> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>> > [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
>> > - 2 Received version: 200000000 my version: 0
>> >
>> >
>> > and also in leader election messages.
>> >
>> > Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
>> > explicitly started it with 100000000.
>> > Then it makes sense that the new config can't be committed since its
>> > version is not high enough...
>> >
>> > I wonder if its possible that not all servers from the previous test are
>> > dead and they are interfering...
>> >
>> >
>> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
>> >
>> >> Hi Alex,
>> >>
>> >> Yeah it is consistently passing in my machine also.
>> >>
>> >>
>> >> I have quickly gone through the
>> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
>> has
>> >> not taken and still leader election is seeing 100000000 (n.config
>> version).
>> >> Unfortunately I didn't find the reason for not considering the updated
>> >> config version.
>> >>
>> >>
>> >> Reference:
>> >>
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>> >>
>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>> >> Notification time out: 51200
>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>> >> Notification time out: 51200
>> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >>
>> >>
>> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >>
>> >>
>> >> -Rakesh
>> >>
>> >> -----Original Message-----
>> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>> >> Sent: 22 July 2014 11:57
>> >> To: dev@zookeeper.apache.org
>> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >>
>> >> I tried to look into it, but the test consistently passes locally on two
>> >> machines.
>> >> I don't currently have access to the build machine, but I can try to ask
>> >> for access.
>> >> Unless anyone has a better suggestion, we could remove the failing test
>> in
>> >> the meanwhile and open a JIRA to add it back...
>> >>
>> >>
>> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >>
>> >> > I'm seeing alot of test failures in
>> >> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
>> >> > Seems related to ZOOKEEPER-1807 recent commit.
>> >> >
>> >> >
>> >> >
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> >> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> >> > tabpanel#comment-14069024
>> >> >
>> >> > Patrick
>> >> >
>> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> >> > <ra...@gmail.com> wrote:
>> >> > > lgtm +1
>> >> > >
>> >> > >
>> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> >> > > <fp...@yahoo.com.invalid>
>> >> > wrote:
>> >> > >
>> >> > >> +1 for having an RC this week. Since this is an alpha release, I
>> >> > >> +think
>> >> > 72
>> >> > >> biz hours is enough for the vote.
>> >> > >>
>> >> > >> -Flavio
>> >> > >>
>> >> > >> > -----Original Message-----
>> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> >> > >> > Sent: 21 July 2014 18:55
>> >> > >> > To: DevZooKeeper
>> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> > >> >
>> >> > >> > I fixed a number of issues. I also started a few threads with
>> >> > >> > builds@
>> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
>> >> > through a
>> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
>> >> close.
>> >> > >> >
>> >> > >> > I don't see why we can't create an RC and start voting this week
>> >> > though.
>> >> > >> > Anyone disagree?
>> >> > >> >
>> >> > >> > How long should we let the vote run, the std 72 biz hours or
>> >> > >> > should we
>> >> > >> plan
>> >> > >> > for more to allow folks more time to test?
>> >> > >> >
>> >> > >> > Patrick
>> >> > >> >
>> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> >> > >> > <rg...@itevenworks.net> wrote:
>> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>> >> > >> > >
>> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
>> >> > trying
>> >> > >> > >> to fix some of the jobs that were broken during the recent
>> >> > >> > >> host upgrade.
>> >> > >> > >>
>> >> > >> > >
>> >> > >> > > How are things looking? Is it likely that we can have a 3.5.0
>> >> > >> > > alpha release week or are we still blocked on Jenkins?
>> >> > >> > >
>> >> > >> > >
>> >> > >> > > -rgs
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >> Patrick
>> >> > >> > >>
>> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> >> > >> > >> <mi...@cs.stanford.edu>
>> >> > >> > >> wrote:
>> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>> >> > >> > >> >
>> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> >> > >> > >> > <sh...@gmail.com>
>> >> > >> > >> wrote:
>> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
>> >> > >> > >> >> all
>> >> > >> > >> subsequent
>> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>> >> > >> > >> >>
>> >> > >> > >> >>
>> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> >> > >> > >> >> <phunt@apache.org
>> >> > >
>> >> > >> > wrote:
>> >> > >> > >> >>
>> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
>> >> > >> > >> >>> for
>> >> > e.g.
>> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
>> >> > >> > >> >>> startServer fix
>> >> > and
>> >> > >> > >> >>> some other flakeys that have been addressed go a long way
>> >> > >> > >> >>> on
>> >> > that
>> >> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
>> >> > >> > >> >>> up before we cut a release. I started a separate thread to
>> >> > >> > >> >>> discuss
>> >> > >> the
>> >> > >> > findbugs issue.
>> >> > >> > >> >>>
>> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> >> > >> > >> >>>
>> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>> >> > >> > >> >>>
>> >> > >> > >> >>> Patrick
>> >> > >> > >> >>>
>> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> >> > >> > >> >>> > My take:
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
>> >> > >> > >> >>> > go
>> >> > in.
>> >> > >> > >> >>> > See
>> >> > >> > >> the
>> >> > >> > >> >>> jira for comments.
>> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>> >> > >> > >> >>> > ZK-1919
>> >> > and
>> >> > >> > >> ZK-1910,
>> >> > >> > >> >>> we can leave it for the second alpha.
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > If you agree with this, then we should be able to cut a
>> >> > >> > >> >>> > candidate by
>> >> > >> > >> the
>> >> > >> > >> >>> end of this week.
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > -Flavio
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> >> > >> > >> >>> > <ph...@apache.org>
>> >> > >> wrote:
>> >> > >> > >> >>> >
>> >> > >> > >> >>> >> Per my previous note you can now see the c client test
>> >> > >> > >> >>> >> log output
>> >> > >> > >> here
>> >> > >> > >> >>> >> in the "build artifacts" section:
>> >> > >> > >> >>> >>
>> >> > >> > >> >>>
>> >> > >> > >>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> >> > >> > >> r-
>> >> > >> > trunk
>> >> > >> > >> /2372/
>> >> > >> > >> >>> >>
>> >> > >> > >> >>> >> Patrick
>> >> > >> > >> >>> >>
>> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> >> > >> > >> >>> >> <ph...@apache.org>
>> >> > >> > >> wrote:
>> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
>> >> > >> > >> >>> >>> to me which
>> >> > >> > >> >>> >>> one(s?) is new?)
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
>> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
>> >> > >> > >> >>> >>> updated the "archive" list
>> >> > >> > >> to
>> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
>> >> > >> > >> >>> >>> go
>> >> > to
>> >> > >> > >> console
>> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
>> >> > server
>> >> > >> > >> support
>> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
>> >> > >> > >> "ZOOKEEPER-1919
>> >> > >> > >> >>> >>> Update the C implementation of removeWatches to have
>> >> > >> > >> >>> >>> it
>> >> > >> > match
>> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>> >> > >> Flavio/Michi?
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
>> >> > >> > >> >>> >>> reconfig
>> >> > >> > >> test -
>> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
>> >> > close.
>> >> > >> > >> >>> >>> Do
>> >> > >> > >> folks
>> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
>> >> > >> > >> >>> >>> (with a few
>> >> > >> > >> pending
>> >> > >> > >> >>> >>> as above)
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Patrick
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>> >> > >> > >> >>> >>>> wrote:
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
>> >> > >> > >> >>> >>>>> release
>> >> > an
>> >> > >> > >> >>> >>>>> alpha
>> >> > >> > >> >>> version
>> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
>> >> > >> > >> >>> >>>> weeks
>> >> > time?
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>> +1
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>> -rgs
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>>> -Flavio
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
>> >> > Segalés <
>> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>> >> > >> > wrote:
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
>> >> > >> > >> >>> >>>>>>> in
>> >> > the
>> >> > >> > >> >>> >>>>>>> last
>> >> > >> > >> >>> check).
>> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>> >> > >> > >> >>> >>>>>>> Camille is
>> >> > >> > >> threatening
>> >> > >> > >> >>> to
>> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
>> >> > >> > >> >>> >>>>>>> on
>> >> > the
>> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>> >> > >> > >> >>> >>>>>>> should we drop
>> >> > >> > >> jdk6
>> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
>> >> > >> > >> >>> >>>>>>> but I suspect
>> >> > >> > >> some
>> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
>> >> > >> > >> >>> >>>>>>> forward though,
>> >> > >> > >> >>> can't
>> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
>> >> > currently
>> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> >> > >> > >> >>> >>>>>>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> Extra eyes/review for
>> >> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>> >> > >> > >> >>> >>>>>> Observers with the
>> >> > >> > >> >>> upcoming
>> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>> >> wild...).
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> -rgs
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
>> >> > >> > >> >>> >>>>>>>> but I need a +1
>> >> > >> > >> >>> >>>> there. I
>> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>> >> > >> > >> >>> >>>>>>> failed in the
>> >> > >> > >> last QA
>> >> > >> > >> >>> run
>> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
>> >> > >> > >> >>> >>>>>>> and
>> >> > in
>> >> > >> > >> >>> >>>>>>> fact
>> >> > >> > >> it
>> >> > >> > >> >>> fails
>> >> > >> > >> >>> >>>> in
>> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
>> >> > >> > >> >>> >>>>>>> seem
>> >> > to
>> >> > >> > >> >>> >>>>>>> be
>> >> > >> > >> >>> related
>> >> > >> > >> >>> >>>> to the
>> >> > >> > >> >>> >>>>>>> patch.
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>> >> > >> > >> >>> >>>>>>>> anything concrete
>> >> > >> > >> about
>> >> > >> > >> >>> it.
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> -Flavio
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>> >> > >> > >> phunt@apache.org>
>> >> > >> > >> >>> >>>> wrote:
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> >> > >> > >> reviewed/finalized
>> >> > >> > >> >>> before
>> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
>> >> > >> > >> >>> >>>>>>>>> RC
>> >> > >> > soonish...
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>> wrote:
>> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863
>> in.
>> >> > >> > >> >>> >>>>>>>>>> They are
>> >> > >> > >> both
>> >> > >> > >> >>> >>>> patch
>> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
>> >> > >> > >> >>> >>>>>>> open because we
>> >> > >> > >> >>> need a
>> >> > >> > >> >>> >>>> 3.4
>> >> > >> > >> >>> >>>>>>> patch.
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> -Flavio
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> >> > >> > >> >>> wrote:
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
>> >> > while, a
>> >> > >> > >> >>> >>>>>>>>>>> few
>> >> > >> > >> >>> people
>> >> > >> > >> >>> >>>> have
>> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
>> >> > >> > >> >>> >>>>>>>>>>> personally
>> >> > >> > >> that
>> >> > >> > >> >>> they
>> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
>> >> > >> > >> >>> >>>>>>>>>>> 3.5
>> >> > >> > release.
>> >> > >> > >> Every
>> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> >> > >> > >> never
>> >> > >> > >> >>> get
>> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
>> >> > >> > >> >>> >>>>>>>>>>> new features,
>> >> > >> > >> >>> lots of
>> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
>> >> > >> > >> >>> >>>>>>>>>>> that folks can
>> >> > >> > >> use
>> >> > >> > >> >>> it,
>> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
>> >> > >> > >> >>> >>>>>>>>>>> for the known
>> >> > >> > >> >>> flakey
>> >> > >> > >> >>> >>>> test
>> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
>> >> > >> > trunk.
>> >> > >> > >> >>> >>>>>>>>>>> Note
>> >> > >> > >> that
>> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
>> >> > >> > >> >>> >>>>>>>>>>> jdk7
>> >> > and
>> >> > >> > jdk8.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
>> >> > >> > >> >>> >>>>>>>>>>> our
>> >> > >> > >> releases:
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> >> > for
>> >> > >> > >> >>> >>>>>>>>>>> some
>> >> > >> > >> time.
>> >> > >> > >> >>> >>>> What I
>> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
>> >> > >> > >> >>> >>>>>>>>>>> we
>> >> > did
>> >> > >> > >> >>> >>>>>>>>>>> for
>> >> > >> > >> 3.4.
>> >> > >> > >> >>> >>>> (this is
>> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
>> >> > Hadoop 2
>> >> > >> > >> release
>> >> > >> > >> >>> >>>> cycle)
>> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>> >> > >> > >> >>> >>>>>>>>>>> something people
>> >> > >> > >> can run
>> >> > >> > >> >>> >>>> and
>> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
>> >> > >> > >> >>> >>>>>>>>>>> and
>> >> > feel
>> >> > >> > >> >>> comfortable
>> >> > >> > >> >>> >>>> with
>> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
>> >> beta.
>> >> > >> > >> >>> >>>>>>>>>>> Once we
>> >> > >> > >> get
>> >> > >> > >> >>> >>>> some
>> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
>> >> > >> > and
>> >> > >> > >> >>> >>>>>>>>>>> look at
>> >> > >> > >> >>> making
>> >> > >> > >> >>> >>>> it
>> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
>> >> > >> > >> >>> >>>>>>>>>>> the
>> >> > >> > >> >>> "current/stable"
>> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> e.g.
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
>> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
>> >> > >> > >> >>> >>>>>>>>>>> also not
>> >> > >> > >> "stable" vs
>> >> > >> > >> >>> >>>> 3.4.x,
>> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>> >> > >> > >> >>> >>>>>>>>>>> expect things to
>> >> > >> > >> shake
>> >> > >> > >> >>> >>>> out)
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> >> > >> > >> >>> >>>>>>>>>>> ....
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
>> >> > production
>> >> > >> > >> >>> >>>>>>>>>>> use,
>> >> > >> > >> >>> stable,
>> >> > >> > >> >>> >>>>>>> etc...
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
>> >> > >> > >> >>> >>>>>>>>>>> these something
>> >> > >> > >> that
>> >> > >> > >> >>> >>>> should
>> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
>> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>> >> > >> > >> find
>> >> > >> > >> >>> >>>> this a
>> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
>> >> > >> > >> >>> >>>>>>>>>>> cut
>> >> > an
>> >> > >> RC.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >
>> >> > >> > >> >>>
>> >> > >> > >>
>> >> > >>
>> >> > >>
>> >> >
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Hm. I'm a bit concerned by something related to reconfig.

I see that "getConfig" and "reconfig" APIs were added to the ZooKeeper
client object. It seems that these should not be exposed to regular
users, rather just to administrators. What security controls have we
put in place around these apis? (sorry if I missed it, it's not clear
from the javadoc, and I don't believe there are any reconfig updates
to the admin guide yet?).

Patrick

On Tue, Jul 22, 2014 at 11:18 AM, Patrick Hunt <ph...@apache.org> wrote:
> Looks like 3 hasn't been removed (unfortunately the assertion doesn't
> include any msg detail, but that's the way it looks to me like the
> test is setup):
>
>         if (leavingServers != null) {
>             for (String leaving : leavingServers)
>
> Assert.assertFalse(configStr.contains("server.".concat(leaving)));
>         }
>
> which is called from:
>
>         qu.restart(2);
>         // Now that 2 is back up, they'll complete the reconfig removing 3 and
>         // can process other ops.
>         testServerHasConfig(zkArr[1], null, leavingServers);
>
> It seems like the problem is that testServerHasConfig is not waiting
> for the configuration to be updated? In this case 2 was just restarted
> and 3 hasn't had a chance to be removed? (on a slower machine say,
> which might be why you aren't seeing the issue? hence the flakeyness)
>
> Patrick
>
> On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com> wrote:
>> Hi Patrick, I'm not sure why you're seeing this - it consistently passes on
>> my machine. In case you'd like to take a look, the test has tons of
>> comments explaining the scenario. Let me know how I can help.
>>
>>
>> On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>>> multiple times (not every time, but ~50%, so flakey) in the last few
>>> days. It's failing both on jdk6 and jdk7. (this is my personal
>>> jenkins, I haven't see any other failures than this during the past
>>> few days).
>>>
>>> junit.framework.AssertionFailedError
>>> at
>>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>>> at
>>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>>> at
>>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>>
>>> Patrick
>>>
>>> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
>>> wrote:
>>> > Hi Rakesh,
>>> >
>>> > Thanks for looking at this. In general even if we find the bug since we
>>> > should test it before committing a fix, it seems better to remove the
>>> test
>>> > for now and debug this on a build machine. I'm trying to get access to
>>> it.
>>> >
>>> > Looking at this log:
>>> >
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>>> >
>>> > Something weird is going on. Sever 3 hasn't started yet, but version
>>> 200000000
>>> > is already being sent around as committed!
>>> >
>>> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>>> > [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
>>> > - 2 Received version: 200000000 my version: 0
>>> >
>>> >
>>> > and also in leader election messages.
>>> >
>>> > Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
>>> > explicitly started it with 100000000.
>>> > Then it makes sense that the new config can't be committed since its
>>> > version is not high enough...
>>> >
>>> > I wonder if its possible that not all servers from the previous test are
>>> > dead and they are interfering...
>>> >
>>> >
>>> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
>>> >
>>> >> Hi Alex,
>>> >>
>>> >> Yeah it is consistently passing in my machine also.
>>> >>
>>> >>
>>> >> I have quickly gone through the
>>> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>>> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
>>> has
>>> >> not taken and still leader election is seeing 100000000 (n.config
>>> version).
>>> >> Unfortunately I didn't find the reason for not considering the updated
>>> >> config version.
>>> >>
>>> >>
>>> >> Reference:
>>> >>
>>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>> >>
>>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>>> >> Notification time out: 51200
>>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>>> >> Notification time out: 51200
>>> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >>
>>> >>
>>> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>>> >> state)100000000 (n.config version)
>>> >>
>>> >>
>>> >> -Rakesh
>>> >>
>>> >> -----Original Message-----
>>> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>>> >> Sent: 22 July 2014 11:57
>>> >> To: dev@zookeeper.apache.org
>>> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >>
>>> >> I tried to look into it, but the test consistently passes locally on two
>>> >> machines.
>>> >> I don't currently have access to the build machine, but I can try to ask
>>> >> for access.
>>> >> Unless anyone has a better suggestion, we could remove the failing test
>>> in
>>> >> the meanwhile and open a JIRA to add it back...
>>> >>
>>> >>
>>> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>>> wrote:
>>> >>
>>> >> > I'm seeing alot of test failures in
>>> >> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
>>> >> > Seems related to ZOOKEEPER-1807 recent commit.
>>> >> >
>>> >> >
>>> >> >
>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>>> >> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>>> >> > tabpanel#comment-14069024
>>> >> >
>>> >> > Patrick
>>> >> >
>>> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>>> >> > <ra...@gmail.com> wrote:
>>> >> > > lgtm +1
>>> >> > >
>>> >> > >
>>> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>>> >> > > <fp...@yahoo.com.invalid>
>>> >> > wrote:
>>> >> > >
>>> >> > >> +1 for having an RC this week. Since this is an alpha release, I
>>> >> > >> +think
>>> >> > 72
>>> >> > >> biz hours is enough for the vote.
>>> >> > >>
>>> >> > >> -Flavio
>>> >> > >>
>>> >> > >> > -----Original Message-----
>>> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>>> >> > >> > Sent: 21 July 2014 18:55
>>> >> > >> > To: DevZooKeeper
>>> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>>> >> > >> >
>>> >> > >> > I fixed a number of issues. I also started a few threads with
>>> >> > >> > builds@
>>> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
>>> >> > through a
>>> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
>>> >> close.
>>> >> > >> >
>>> >> > >> > I don't see why we can't create an RC and start voting this week
>>> >> > though.
>>> >> > >> > Anyone disagree?
>>> >> > >> >
>>> >> > >> > How long should we let the vote run, the std 72 biz hours or
>>> >> > >> > should we
>>> >> > >> plan
>>> >> > >> > for more to allow folks more time to test?
>>> >> > >> >
>>> >> > >> > Patrick
>>> >> > >> >
>>> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>>> >> > >> > <rg...@itevenworks.net> wrote:
>>> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>>> >> > >> > >
>>> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
>>> >> > trying
>>> >> > >> > >> to fix some of the jobs that were broken during the recent
>>> >> > >> > >> host upgrade.
>>> >> > >> > >>
>>> >> > >> > >
>>> >> > >> > > How are things looking? Is it likely that we can have a 3.5.0
>>> >> > >> > > alpha release week or are we still blocked on Jenkins?
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > > -rgs
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > >
>>> >> > >> > >> Patrick
>>> >> > >> > >>
>>> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>>> >> > >> > >> <mi...@cs.stanford.edu>
>>> >> > >> > >> wrote:
>>> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>>> >> > >> > >> >
>>> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>>> >> > >> > >> > <sh...@gmail.com>
>>> >> > >> > >> wrote:
>>> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
>>> >> > >> > >> >> all
>>> >> > >> > >> subsequent
>>> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>>> >> > >> > >> >>
>>> >> > >> > >> >>
>>> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>>> >> > >> > >> >> <phunt@apache.org
>>> >> > >
>>> >> > >> > wrote:
>>> >> > >> > >> >>
>>> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
>>> >> > >> > >> >>> for
>>> >> > e.g.
>>> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>>> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
>>> >> > >> > >> >>> startServer fix
>>> >> > and
>>> >> > >> > >> >>> some other flakeys that have been addressed go a long way
>>> >> > >> > >> >>> on
>>> >> > that
>>> >> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
>>> >> > >> > >> >>> up before we cut a release. I started a separate thread to
>>> >> > >> > >> >>> discuss
>>> >> > >> the
>>> >> > >> > findbugs issue.
>>> >> > >> > >> >>>
>>> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>>> >> > >> > >> >>>
>>> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>>> >> > >> > >> >>>
>>> >> > >> > >> >>> Patrick
>>> >> > >> > >> >>>
>>> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>>> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>>> >> > >> > >> >>> > My take:
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
>>> >> > >> > >> >>> > go
>>> >> > in.
>>> >> > >> > >> >>> > See
>>> >> > >> > >> the
>>> >> > >> > >> >>> jira for comments.
>>> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>>> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>>> >> > >> > >> >>> > ZK-1919
>>> >> > and
>>> >> > >> > >> ZK-1910,
>>> >> > >> > >> >>> we can leave it for the second alpha.
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>> > If you agree with this, then we should be able to cut a
>>> >> > >> > >> >>> > candidate by
>>> >> > >> > >> the
>>> >> > >> > >> >>> end of this week.
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>> > -Flavio
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>>> >> > >> > >> >>> > <ph...@apache.org>
>>> >> > >> wrote:
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>> >> Per my previous note you can now see the c client test
>>> >> > >> > >> >>> >> log output
>>> >> > >> > >> here
>>> >> > >> > >> >>> >> in the "build artifacts" section:
>>> >> > >> > >> >>> >>
>>> >> > >> > >> >>>
>>> >> > >> > >>
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>>> >> > >> > >> r-
>>> >> > >> > trunk
>>> >> > >> > >> /2372/
>>> >> > >> > >> >>> >>
>>> >> > >> > >> >>> >> Patrick
>>> >> > >> > >> >>> >>
>>> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>>> >> > >> > >> >>> >> <ph...@apache.org>
>>> >> > >> > >> wrote:
>>> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
>>> >> > >> > >> >>> >>> to me which
>>> >> > >> > >> >>> >>> one(s?) is new?)
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
>>> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
>>> >> > >> > >> >>> >>> updated the "archive" list
>>> >> > >> > >> to
>>> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
>>> >> > >> > >> >>> >>> go
>>> >> > to
>>> >> > >> > >> console
>>> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
>>> >> > server
>>> >> > >> > >> support
>>> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
>>> >> > >> > >> "ZOOKEEPER-1919
>>> >> > >> > >> >>> >>> Update the C implementation of removeWatches to have
>>> >> > >> > >> >>> >>> it
>>> >> > >> > match
>>> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>>> >> > >> Flavio/Michi?
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
>>> >> > >> > >> >>> >>> reconfig
>>> >> > >> > >> test -
>>> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
>>> >> > close.
>>> >> > >> > >> >>> >>> Do
>>> >> > >> > >> folks
>>> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
>>> >> > >> > >> >>> >>> (with a few
>>> >> > >> > >> pending
>>> >> > >> > >> >>> >>> as above)
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> Patrick
>>> >> > >> > >> >>> >>>
>>> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>>> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>>> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>>> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>>> >> > >> > >> >>> >>>> wrote:
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
>>> >> > >> > >> >>> >>>>> release
>>> >> > an
>>> >> > >> > >> >>> >>>>> alpha
>>> >> > >> > >> >>> version
>>> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
>>> >> > >> > >> >>> >>>> weeks
>>> >> > time?
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>
>>> >> > >> > >> >>> >>>> +1
>>> >> > >> > >> >>> >>>>
>>> >> > >> > >> >>> >>>> -rgs
>>> >> > >> > >> >>> >>>>
>>> >> > >> > >> >>> >>>>> -Flavio
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
>>> >> > Segalés <
>>> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>>> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>>> >> > >> > wrote:
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
>>> >> > >> > >> >>> >>>>>>> in
>>> >> > the
>>> >> > >> > >> >>> >>>>>>> last
>>> >> > >> > >> >>> check).
>>> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>>> >> > >> > >> >>> >>>>>>> Camille is
>>> >> > >> > >> threatening
>>> >> > >> > >> >>> to
>>> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
>>> >> > >> > >> >>> >>>>>>> on
>>> >> > the
>>> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
>>> >> > >> > >> >>> >>>>>>>
>>> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>>> >> > >> > >> >>> >>>>>>> should we drop
>>> >> > >> > >> jdk6
>>> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
>>> >> > >> > >> >>> >>>>>>> but I suspect
>>> >> > >> > >> some
>>> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
>>> >> > >> > >> >>> >>>>>>> forward though,
>>> >> > >> > >> >>> can't
>>> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
>>> >> > currently
>>> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>>> >> > >> > >> >>> >>>>>>>
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>> >> > >> > >> >>> >>>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>> Extra eyes/review for
>>> >> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>>> >> > >> > >> >>> >>>>>> Observers with the
>>> >> > >> > >> >>> upcoming
>>> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>>> >> wild...).
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>> -rgs
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>> Patrick
>>> >> > >> > >> >>> >>>>>>>
>>> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
>>> >> > >> > >> >>> >>>>>>>> but I need a +1
>>> >> > >> > >> >>> >>>> there. I
>>> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>>> >> > >> > >> >>> >>>>>>> failed in the
>>> >> > >> > >> last QA
>>> >> > >> > >> >>> run
>>> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
>>> >> > >> > >> >>> >>>>>>> and
>>> >> > in
>>> >> > >> > >> >>> >>>>>>> fact
>>> >> > >> > >> it
>>> >> > >> > >> >>> fails
>>> >> > >> > >> >>> >>>> in
>>> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
>>> >> > >> > >> >>> >>>>>>> seem
>>> >> > to
>>> >> > >> > >> >>> >>>>>>> be
>>> >> > >> > >> >>> related
>>> >> > >> > >> >>> >>>> to the
>>> >> > >> > >> >>> >>>>>>> patch.
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>>> >> > >> > >> >>> >>>>>>>> anything concrete
>>> >> > >> > >> about
>>> >> > >> > >> >>> it.
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>> -Flavio
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>>> >> > >> > >> phunt@apache.org>
>>> >> > >> > >> >>> >>>> wrote:
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>>> >> > >> > >> reviewed/finalized
>>> >> > >> > >> >>> before
>>> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
>>> >> > >> > >> >>> >>>>>>>>> RC
>>> >> > >> > soonish...
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>> Patrick
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>>> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>>> wrote:
>>> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863
>>> in.
>>> >> > >> > >> >>> >>>>>>>>>> They are
>>> >> > >> > >> both
>>> >> > >> > >> >>> >>>> patch
>>> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
>>> >> > >> > >> >>> >>>>>>> open because we
>>> >> > >> > >> >>> need a
>>> >> > >> > >> >>> >>>> 3.4
>>> >> > >> > >> >>> >>>>>>> patch.
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>> -Flavio
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>>> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>>> >> > >> > >> >>> wrote:
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
>>> >> > while, a
>>> >> > >> > >> >>> >>>>>>>>>>> few
>>> >> > >> > >> >>> people
>>> >> > >> > >> >>> >>>> have
>>> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
>>> >> > >> > >> >>> >>>>>>>>>>> personally
>>> >> > >> > >> that
>>> >> > >> > >> >>> they
>>> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5
>>> >> > >> > release.
>>> >> > >> > >> Every
>>> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>>> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>>> >> > >> > >> never
>>> >> > >> > >> >>> get
>>> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
>>> >> > >> > >> >>> >>>>>>>>>>> new features,
>>> >> > >> > >> >>> lots of
>>> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
>>> >> > >> > >> >>> >>>>>>>>>>> that folks can
>>> >> > >> > >> use
>>> >> > >> > >> >>> it,
>>> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
>>> >> > >> > >> >>> >>>>>>>>>>> for the known
>>> >> > >> > >> >>> flakey
>>> >> > >> > >> >>> >>>> test
>>> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
>>> >> > >> > trunk.
>>> >> > >> > >> >>> >>>>>>>>>>> Note
>>> >> > >> > >> that
>>> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
>>> >> > >> > >> >>> >>>>>>>>>>> jdk7
>>> >> > and
>>> >> > >> > jdk8.
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
>>> >> > >> > >> >>> >>>>>>>>>>> our
>>> >> > >> > >> releases:
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>>> >> > for
>>> >> > >> > >> >>> >>>>>>>>>>> some
>>> >> > >> > >> time.
>>> >> > >> > >> >>> >>>> What I
>>> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
>>> >> > >> > >> >>> >>>>>>>>>>> we
>>> >> > did
>>> >> > >> > >> >>> >>>>>>>>>>> for
>>> >> > >> > >> 3.4.
>>> >> > >> > >> >>> >>>> (this is
>>> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
>>> >> > Hadoop 2
>>> >> > >> > >> release
>>> >> > >> > >> >>> >>>> cycle)
>>> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>>> >> > >> > >> >>> >>>>>>>>>>> something people
>>> >> > >> > >> can run
>>> >> > >> > >> >>> >>>> and
>>> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
>>> >> > >> > >> >>> >>>>>>>>>>> and
>>> >> > feel
>>> >> > >> > >> >>> comfortable
>>> >> > >> > >> >>> >>>> with
>>> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
>>> >> beta.
>>> >> > >> > >> >>> >>>>>>>>>>> Once we
>>> >> > >> > >> get
>>> >> > >> > >> >>> >>>> some
>>> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
>>> >> > >> > and
>>> >> > >> > >> >>> >>>>>>>>>>> look at
>>> >> > >> > >> >>> making
>>> >> > >> > >> >>> >>>> it
>>> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
>>> >> > >> > >> >>> >>>>>>>>>>> the
>>> >> > >> > >> >>> "current/stable"
>>> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> e.g.
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>>> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
>>> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
>>> >> > >> > >> >>> >>>>>>>>>>> also not
>>> >> > >> > >> "stable" vs
>>> >> > >> > >> >>> >>>> 3.4.x,
>>> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>>> >> > >> > >> >>> >>>>>>>>>>> expect things to
>>> >> > >> > >> shake
>>> >> > >> > >> >>> >>>> out)
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>>> >> > >> > >> >>> >>>>>>>>>>> ....
>>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
>>> >> > production
>>> >> > >> > >> >>> >>>>>>>>>>> use,
>>> >> > >> > >> >>> stable,
>>> >> > >> > >> >>> >>>>>>> etc...
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
>>> >> > >> > >> >>> >>>>>>>>>>> these something
>>> >> > >> > >> that
>>> >> > >> > >> >>> >>>> should
>>> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
>>> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>>> >> > >> > >> find
>>> >> > >> > >> >>> >>>> this a
>>> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
>>> >> > >> > >> >>> >>>>>>>>>>> cut
>>> >> > an
>>> >> > >> RC.
>>> >> > >> > >> >>> >>>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>>> Patrick
>>> >> > >> > >> >>> >>>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>>>
>>> >> > >> > >> >>> >>>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >>>>>>
>>> >> > >> > >> >>> >
>>> >> > >> > >> >>>
>>> >> > >> > >>
>>> >> > >>
>>> >> > >>
>>> >> >
>>> >>
>>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Looks like 3 hasn't been removed (unfortunately the assertion doesn't
include any msg detail, but that's the way it looks to me like the
test is setup):

        if (leavingServers != null) {
            for (String leaving : leavingServers)

Assert.assertFalse(configStr.contains("server.".concat(leaving)));
        }

which is called from:

        qu.restart(2);
        // Now that 2 is back up, they'll complete the reconfig removing 3 and
        // can process other ops.
        testServerHasConfig(zkArr[1], null, leavingServers);

It seems like the problem is that testServerHasConfig is not waiting
for the configuration to be updated? In this case 2 was just restarted
and 3 hasn't had a chance to be removed? (on a slower machine say,
which might be why you aren't seeing the issue? hence the flakeyness)

Patrick

On Tue, Jul 22, 2014 at 10:57 AM, Alexander Shraer <sh...@gmail.com> wrote:
> Hi Patrick, I'm not sure why you're seeing this - it consistently passes on
> my machine. In case you'd like to take a look, the test has tons of
> comments explaining the scenario. Let me know how I can help.
>
>
> On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>
>> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
>> multiple times (not every time, but ~50%, so flakey) in the last few
>> days. It's failing both on jdk6 and jdk7. (this is my personal
>> jenkins, I haven't see any other failures than this during the past
>> few days).
>>
>> junit.framework.AssertionFailedError
>> at
>> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
>> at
>> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
>> at
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>
>> Patrick
>>
>> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> > Hi Rakesh,
>> >
>> > Thanks for looking at this. In general even if we find the bug since we
>> > should test it before committing a fix, it seems better to remove the
>> test
>> > for now and debug this on a build machine. I'm trying to get access to
>> it.
>> >
>> > Looking at this log:
>> >
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>> >
>> > Something weird is going on. Sever 3 hasn't started yet, but version
>> 200000000
>> > is already being sent around as committed!
>> >
>> > 2014-07-21 10:44:50,901 [myid:2] - INFO
>> > [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
>> > - 2 Received version: 200000000 my version: 0
>> >
>> >
>> > and also in leader election messages.
>> >
>> > Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
>> > explicitly started it with 100000000.
>> > Then it makes sense that the new config can't be committed since its
>> > version is not high enough...
>> >
>> > I wonder if its possible that not all servers from the previous test are
>> > dead and they are interfering...
>> >
>> >
>> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
>> >
>> >> Hi Alex,
>> >>
>> >> Yeah it is consistently passing in my machine also.
>> >>
>> >>
>> >> I have quickly gone through the
>> >> testCurrentObserverIsParticipantInNewConfig failure logs in
>> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
>> has
>> >> not taken and still leader election is seeing 100000000 (n.config
>> version).
>> >> Unfortunately I didn't find the reason for not considering the updated
>> >> config version.
>> >>
>> >>
>> >> Reference:
>> >>
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>> >>
>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>> >> Notification time out: 51200
>> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
>> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>> >> Notification time out: 51200
>> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
>> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >>
>> >>
>> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
>> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
>> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> >> state)100000000 (n.config version)
>> >>
>> >>
>> >> -Rakesh
>> >>
>> >> -----Original Message-----
>> >> From: Alexander Shraer [mailto:shralex@gmail.com]
>> >> Sent: 22 July 2014 11:57
>> >> To: dev@zookeeper.apache.org
>> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >>
>> >> I tried to look into it, but the test consistently passes locally on two
>> >> machines.
>> >> I don't currently have access to the build machine, but I can try to ask
>> >> for access.
>> >> Unless anyone has a better suggestion, we could remove the failing test
>> in
>> >> the meanwhile and open a JIRA to add it back...
>> >>
>> >>
>> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >>
>> >> > I'm seeing alot of test failures in
>> >> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
>> >> > Seems related to ZOOKEEPER-1807 recent commit.
>> >> >
>> >> >
>> >> >
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> >> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> >> > tabpanel#comment-14069024
>> >> >
>> >> > Patrick
>> >> >
>> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> >> > <ra...@gmail.com> wrote:
>> >> > > lgtm +1
>> >> > >
>> >> > >
>> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> >> > > <fp...@yahoo.com.invalid>
>> >> > wrote:
>> >> > >
>> >> > >> +1 for having an RC this week. Since this is an alpha release, I
>> >> > >> +think
>> >> > 72
>> >> > >> biz hours is enough for the vote.
>> >> > >>
>> >> > >> -Flavio
>> >> > >>
>> >> > >> > -----Original Message-----
>> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> >> > >> > Sent: 21 July 2014 18:55
>> >> > >> > To: DevZooKeeper
>> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >> > >> >
>> >> > >> > I fixed a number of issues. I also started a few threads with
>> >> > >> > builds@
>> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
>> >> > through a
>> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
>> >> close.
>> >> > >> >
>> >> > >> > I don't see why we can't create an RC and start voting this week
>> >> > though.
>> >> > >> > Anyone disagree?
>> >> > >> >
>> >> > >> > How long should we let the vote run, the std 72 biz hours or
>> >> > >> > should we
>> >> > >> plan
>> >> > >> > for more to allow folks more time to test?
>> >> > >> >
>> >> > >> > Patrick
>> >> > >> >
>> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> >> > >> > <rg...@itevenworks.net> wrote:
>> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>> >> > >> > >
>> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
>> >> > trying
>> >> > >> > >> to fix some of the jobs that were broken during the recent
>> >> > >> > >> host upgrade.
>> >> > >> > >>
>> >> > >> > >
>> >> > >> > > How are things looking? Is it likely that we can have a 3.5.0
>> >> > >> > > alpha release week or are we still blocked on Jenkins?
>> >> > >> > >
>> >> > >> > >
>> >> > >> > > -rgs
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >
>> >> > >> > >> Patrick
>> >> > >> > >>
>> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> >> > >> > >> <mi...@cs.stanford.edu>
>> >> > >> > >> wrote:
>> >> > >> > >> > I'll check in ZOOKEEPER-1683.
>> >> > >> > >> >
>> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> >> > >> > >> > <sh...@gmail.com>
>> >> > >> > >> wrote:
>> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
>> >> > >> > >> >> all
>> >> > >> > >> subsequent
>> >> > >> > >> >> changes were formatting as suggested by Rakesh.
>> >> > >> > >> >>
>> >> > >> > >> >>
>> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> >> > >> > >> >> <phunt@apache.org
>> >> > >
>> >> > >> > wrote:
>> >> > >> > >> >>
>> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
>> >> > >> > >> >>> for
>> >> > e.g.
>> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
>> >> > >> > >> >>> startServer fix
>> >> > and
>> >> > >> > >> >>> some other flakeys that have been addressed go a long way
>> >> > >> > >> >>> on
>> >> > that
>> >> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
>> >> > >> > >> >>> up before we cut a release. I started a separate thread to
>> >> > >> > >> >>> discuss
>> >> > >> the
>> >> > >> > findbugs issue.
>> >> > >> > >> >>>
>> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> >> > >> > >> >>>
>> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>> >> > >> > >> >>>
>> >> > >> > >> >>> Patrick
>> >> > >> > >> >>>
>> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> >> > >> > >> >>> > My take:
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
>> >> > >> > >> >>> > go
>> >> > in.
>> >> > >> > >> >>> > See
>> >> > >> > >> the
>> >> > >> > >> >>> jira for comments.
>> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
>> >> > >> > >> >>> > ZK-1919
>> >> > and
>> >> > >> > >> ZK-1910,
>> >> > >> > >> >>> we can leave it for the second alpha.
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > If you agree with this, then we should be able to cut a
>> >> > >> > >> >>> > candidate by
>> >> > >> > >> the
>> >> > >> > >> >>> end of this week.
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > -Flavio
>> >> > >> > >> >>> >
>> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> >> > >> > >> >>> > <ph...@apache.org>
>> >> > >> wrote:
>> >> > >> > >> >>> >
>> >> > >> > >> >>> >> Per my previous note you can now see the c client test
>> >> > >> > >> >>> >> log output
>> >> > >> > >> here
>> >> > >> > >> >>> >> in the "build artifacts" section:
>> >> > >> > >> >>> >>
>> >> > >> > >> >>>
>> >> > >> > >>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> >> > >> > >> r-
>> >> > >> > trunk
>> >> > >> > >> /2372/
>> >> > >> > >> >>> >>
>> >> > >> > >> >>> >> Patrick
>> >> > >> > >> >>> >>
>> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> >> > >> > >> >>> >> <ph...@apache.org>
>> >> > >> > >> wrote:
>> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
>> >> > >> > >> >>> >>> to me which
>> >> > >> > >> >>> >>> one(s?) is new?)
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
>> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
>> >> > >> > >> >>> >>> updated the "archive" list
>> >> > >> > >> to
>> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
>> >> > >> > >> >>> >>> go
>> >> > to
>> >> > >> > >> console
>> >> > >> > >> >>> >>> at least we can debug when there is a failure.
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
>> >> > server
>> >> > >> > >> support
>> >> > >> > >> >>> >>> and it looks like that should be ready soon.
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
>> >> > >> > >> "ZOOKEEPER-1919
>> >> > >> > >> >>> >>> Update the C implementation of removeWatches to have
>> >> > >> > >> >>> >>> it
>> >> > >> > match
>> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>> >> > >> Flavio/Michi?
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
>> >> > >> > >> >>> >>> reconfig
>> >> > >> > >> test -
>> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
>> >> > close.
>> >> > >> > >> >>> >>> Do
>> >> > >> > >> folks
>> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
>> >> > >> > >> >>> >>> (with a few
>> >> > >> > >> pending
>> >> > >> > >> >>> >>> as above)
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> Patrick
>> >> > >> > >> >>> >>>
>> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> >> > >> > >> >>> <fp...@yahoo.com.invalid>
>> >> > >> > >> >>> >>>> wrote:
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
>> >> > >> > >> >>> >>>>> release
>> >> > an
>> >> > >> > >> >>> >>>>> alpha
>> >> > >> > >> >>> version
>> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
>> >> > >> > >> >>> >>>> weeks
>> >> > time?
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>> +1
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>> -rgs
>> >> > >> > >> >>> >>>>
>> >> > >> > >> >>> >>>>> -Flavio
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
>> >> > Segalés <
>> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> >> > >> > >> >>> >>>>>> <ph...@apache.org>
>> >> > >> > wrote:
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
>> >> > >> > >> >>> >>>>>>> in
>> >> > the
>> >> > >> > >> >>> >>>>>>> last
>> >> > >> > >> >>> check).
>> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>> >> > >> > >> >>> >>>>>>> Camille is
>> >> > >> > >> threatening
>> >> > >> > >> >>> to
>> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
>> >> > >> > >> >>> >>>>>>> on
>> >> > the
>> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
>> >> > >> > >> >>> >>>>>>> should we drop
>> >> > >> > >> jdk6
>> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
>> >> > >> > >> >>> >>>>>>> but I suspect
>> >> > >> > >> some
>> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
>> >> > >> > >> >>> >>>>>>> forward though,
>> >> > >> > >> >>> can't
>> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
>> >> > currently
>> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> >> > >> > >> >>> >>>>>>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> Extra eyes/review for
>> >> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>> >> > >> > >> >>> >>>>>> Observers with the
>> >> > >> > >> >>> upcoming
>> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
>> >> wild...).
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>> -rgs
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
>> >> > >> > >> >>> >>>>>>>> but I need a +1
>> >> > >> > >> >>> >>>> there. I
>> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>> >> > >> > >> >>> >>>>>>> failed in the
>> >> > >> > >> last QA
>> >> > >> > >> >>> run
>> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
>> >> > >> > >> >>> >>>>>>> and
>> >> > in
>> >> > >> > >> >>> >>>>>>> fact
>> >> > >> > >> it
>> >> > >> > >> >>> fails
>> >> > >> > >> >>> >>>> in
>> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
>> >> > >> > >> >>> >>>>>>> seem
>> >> > to
>> >> > >> > >> >>> >>>>>>> be
>> >> > >> > >> >>> related
>> >> > >> > >> >>> >>>> to the
>> >> > >> > >> >>> >>>>>>> patch.
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>> >> > >> > >> >>> >>>>>>>> anything concrete
>> >> > >> > >> about
>> >> > >> > >> >>> it.
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> -Flavio
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>> >> > >> > >> phunt@apache.org>
>> >> > >> > >> >>> >>>> wrote:
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> >> > >> > >> reviewed/finalized
>> >> > >> > >> >>> before
>> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
>> >> > >> > >> >>> >>>>>>>>> RC
>> >> > >> > soonish...
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
>> wrote:
>> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863
>> in.
>> >> > >> > >> >>> >>>>>>>>>> They are
>> >> > >> > >> both
>> >> > >> > >> >>> >>>> patch
>> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
>> >> > >> > >> >>> >>>>>>> open because we
>> >> > >> > >> >>> need a
>> >> > >> > >> >>> >>>> 3.4
>> >> > >> > >> >>> >>>>>>> patch.
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> -Flavio
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> >> > >> > >> >>> wrote:
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
>> >> > while, a
>> >> > >> > >> >>> >>>>>>>>>>> few
>> >> > >> > >> >>> people
>> >> > >> > >> >>> >>>> have
>> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
>> >> > >> > >> >>> >>>>>>>>>>> personally
>> >> > >> > >> that
>> >> > >> > >> >>> they
>> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
>> >> > >> > >> >>> >>>>>>>>>>> 3.5
>> >> > >> > release.
>> >> > >> > >> Every
>> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> >> > >> > >> never
>> >> > >> > >> >>> get
>> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
>> >> > >> > >> >>> >>>>>>>>>>> new features,
>> >> > >> > >> >>> lots of
>> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
>> >> > >> > >> >>> >>>>>>>>>>> that folks can
>> >> > >> > >> use
>> >> > >> > >> >>> it,
>> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
>> >> > >> > >> >>> >>>>>>>>>>> for the known
>> >> > >> > >> >>> flakey
>> >> > >> > >> >>> >>>> test
>> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
>> >> > >> > trunk.
>> >> > >> > >> >>> >>>>>>>>>>> Note
>> >> > >> > >> that
>> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
>> >> > >> > >> >>> >>>>>>>>>>> jdk7
>> >> > and
>> >> > >> > jdk8.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
>> >> > >> > >> >>> >>>>>>>>>>> our
>> >> > >> > >> releases:
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> >> > for
>> >> > >> > >> >>> >>>>>>>>>>> some
>> >> > >> > >> time.
>> >> > >> > >> >>> >>>> What I
>> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
>> >> > >> > >> >>> >>>>>>>>>>> we
>> >> > did
>> >> > >> > >> >>> >>>>>>>>>>> for
>> >> > >> > >> 3.4.
>> >> > >> > >> >>> >>>> (this is
>> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
>> >> > Hadoop 2
>> >> > >> > >> release
>> >> > >> > >> >>> >>>> cycle)
>> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>> >> > >> > >> >>> >>>>>>>>>>> something people
>> >> > >> > >> can run
>> >> > >> > >> >>> >>>> and
>> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
>> >> > >> > >> >>> >>>>>>>>>>> and
>> >> > feel
>> >> > >> > >> >>> comfortable
>> >> > >> > >> >>> >>>> with
>> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
>> >> beta.
>> >> > >> > >> >>> >>>>>>>>>>> Once we
>> >> > >> > >> get
>> >> > >> > >> >>> >>>> some
>> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
>> >> > >> > and
>> >> > >> > >> >>> >>>>>>>>>>> look at
>> >> > >> > >> >>> making
>> >> > >> > >> >>> >>>> it
>> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
>> >> > >> > >> >>> >>>>>>>>>>> the
>> >> > >> > >> >>> "current/stable"
>> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> e.g.
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
>> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
>> >> > >> > >> >>> >>>>>>>>>>> also not
>> >> > >> > >> "stable" vs
>> >> > >> > >> >>> >>>> 3.4.x,
>> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>> >> > >> > >> >>> >>>>>>>>>>> expect things to
>> >> > >> > >> shake
>> >> > >> > >> >>> >>>> out)
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> >> > >> > >> >>> >>>>>>>>>>> ....
>> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
>> >> > production
>> >> > >> > >> >>> >>>>>>>>>>> use,
>> >> > >> > >> >>> stable,
>> >> > >> > >> >>> >>>>>>> etc...
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
>> >> > >> > >> >>> >>>>>>>>>>> these something
>> >> > >> > >> that
>> >> > >> > >> >>> >>>> should
>> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
>> >> > >> > >> >>> >>>>>>>>>>> days. If folks
>> >> > >> > >> find
>> >> > >> > >> >>> >>>> this a
>> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
>> >> > >> > >> >>> >>>>>>>>>>> cut
>> >> > an
>> >> > >> RC.
>> >> > >> > >> >>> >>>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>>> Patrick
>> >> > >> > >> >>> >>>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>>>
>> >> > >> > >> >>> >>>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >>>>>>
>> >> > >> > >> >>> >
>> >> > >> > >> >>>
>> >> > >> > >>
>> >> > >>
>> >> > >>
>> >> >
>> >>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Hi Patrick, I'm not sure why you're seeing this - it consistently passes on
my machine. In case you'd like to take a look, the test has tons of
comments explaining the scenario. Let me know how I can help.


On Tue, Jul 22, 2014 at 9:53 AM, Patrick Hunt <ph...@apache.org> wrote:

> Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
> multiple times (not every time, but ~50%, so flakey) in the last few
> days. It's failing both on jdk6 and jdk7. (this is my personal
> jenkins, I haven't see any other failures than this during the past
> few days).
>
> junit.framework.AssertionFailedError
> at
> org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
> at
> org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
> at
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>
> Patrick
>
> On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
> > Hi Rakesh,
> >
> > Thanks for looking at this. In general even if we find the bug since we
> > should test it before committing a fix, it seems better to remove the
> test
> > for now and debug this on a build machine. I'm trying to get access to
> it.
> >
> > Looking at this log:
> >
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
> >
> > Something weird is going on. Sever 3 hasn't started yet, but version
> 200000000
> > is already being sent around as committed!
> >
> > 2014-07-21 10:44:50,901 [myid:2] - INFO
> > [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
> > - 2 Received version: 200000000 my version: 0
> >
> >
> > and also in leader election messages.
> >
> > Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
> > explicitly started it with 100000000.
> > Then it makes sense that the new config can't be committed since its
> > version is not high enough...
> >
> > I wonder if its possible that not all servers from the previous test are
> > dead and they are interfering...
> >
> >
> > On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
> >
> >> Hi Alex,
> >>
> >> Yeah it is consistently passing in my machine also.
> >>
> >>
> >> I have quickly gone through the
> >> testCurrentObserverIsParticipantInNewConfig failure logs in
> >> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version)
> has
> >> not taken and still leader election is seeing 100000000 (n.config
> version).
> >> Unfortunately I didn't find the reason for not considering the updated
> >> config version.
> >>
> >>
> >> Reference:
> >>
> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
> >>
> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
> >> Notification time out: 51200
> >> 2014-07-22 06:38:00,330 [myid:1] - INFO
> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >> 2014-07-22 06:38:00,330 [myid:2] - INFO
> >>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
> >> Notification time out: 51200
> >> 2014-07-22 06:38:00,331 [myid:0] - INFO
> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >> 2014-07-22 06:38:00,331 [myid:2] - INFO
> >>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >>
> >>
> >> 2014-07-22 06:38:00,332 [myid:0] - INFO
> >>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >> 2014-07-22 06:38:00,332 [myid:1] - INFO
> >>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> >> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> >> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> >> state)100000000 (n.config version)
> >>
> >>
> >> -Rakesh
> >>
> >> -----Original Message-----
> >> From: Alexander Shraer [mailto:shralex@gmail.com]
> >> Sent: 22 July 2014 11:57
> >> To: dev@zookeeper.apache.org
> >> Subject: Re: ZooKeeper 3.5.0-alpha planning
> >>
> >> I tried to look into it, but the test consistently passes locally on two
> >> machines.
> >> I don't currently have access to the build machine, but I can try to ask
> >> for access.
> >> Unless anyone has a better suggestion, we could remove the failing test
> in
> >> the meanwhile and open a JIRA to add it back...
> >>
> >>
> >> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org>
> wrote:
> >>
> >> > I'm seeing alot of test failures in
> >> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
> >> > Seems related to ZOOKEEPER-1807 recent commit.
> >> >
> >> >
> >> >
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> >> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> >> > tabpanel#comment-14069024
> >> >
> >> > Patrick
> >> >
> >> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> >> > <ra...@gmail.com> wrote:
> >> > > lgtm +1
> >> > >
> >> > >
> >> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
> >> > > <fp...@yahoo.com.invalid>
> >> > wrote:
> >> > >
> >> > >> +1 for having an RC this week. Since this is an alpha release, I
> >> > >> +think
> >> > 72
> >> > >> biz hours is enough for the vote.
> >> > >>
> >> > >> -Flavio
> >> > >>
> >> > >> > -----Original Message-----
> >> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >> > >> > Sent: 21 July 2014 18:55
> >> > >> > To: DevZooKeeper
> >> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> > >> >
> >> > >> > I fixed a number of issues. I also started a few threads with
> >> > >> > builds@
> >> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
> >> > through a
> >> > >> > number of findbugs issues, it's not closed yet but it's pretty
> >> close.
> >> > >> >
> >> > >> > I don't see why we can't create an RC and start voting this week
> >> > though.
> >> > >> > Anyone disagree?
> >> > >> >
> >> > >> > How long should we let the vote run, the std 72 biz hours or
> >> > >> > should we
> >> > >> plan
> >> > >> > for more to allow folks more time to test?
> >> > >> >
> >> > >> > Patrick
> >> > >> >
> >> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> >> > >> > <rg...@itevenworks.net> wrote:
> >> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> >> > >> > >
> >> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
> >> > trying
> >> > >> > >> to fix some of the jobs that were broken during the recent
> >> > >> > >> host upgrade.
> >> > >> > >>
> >> > >> > >
> >> > >> > > How are things looking? Is it likely that we can have a 3.5.0
> >> > >> > > alpha release week or are we still blocked on Jenkins?
> >> > >> > >
> >> > >> > >
> >> > >> > > -rgs
> >> > >> > >
> >> > >> > >
> >> > >> > >
> >> > >> > >
> >> > >> > >
> >> > >> > >
> >> > >> > >> Patrick
> >> > >> > >>
> >> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >> > >> > >> <mi...@cs.stanford.edu>
> >> > >> > >> wrote:
> >> > >> > >> > I'll check in ZOOKEEPER-1683.
> >> > >> > >> >
> >> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >> > >> > >> > <sh...@gmail.com>
> >> > >> > >> wrote:
> >> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
> >> > >> > >> >> all
> >> > >> > >> subsequent
> >> > >> > >> >> changes were formatting as suggested by Rakesh.
> >> > >> > >> >>
> >> > >> > >> >>
> >> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
> >> > >> > >> >> <phunt@apache.org
> >> > >
> >> > >> > wrote:
> >> > >> > >> >>
> >> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
> >> > >> > >> >>> for
> >> > e.g.
> >> > >> > >> >>> findbugs issues. At the very least our build/test/ci
> >> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
> >> > >> > >> >>> startServer fix
> >> > and
> >> > >> > >> >>> some other flakeys that have been addressed go a long way
> >> > >> > >> >>> on
> >> > that
> >> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
> >> > >> > >> >>> up before we cut a release. I started a separate thread to
> >> > >> > >> >>> discuss
> >> > >> the
> >> > >> > findbugs issue.
> >> > >> > >> >>>
> >> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> > >> > >> >>>
> >> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> >> > >> > >> >>>
> >> > >> > >> >>> Patrick
> >> > >> > >> >>>
> >> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >> > >> > >> >>> > My take:
> >> > >> > >> >>> >
> >> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
> >> > >> > >> >>> > go
> >> > in.
> >> > >> > >> >>> > See
> >> > >> > >> the
> >> > >> > >> >>> jira for comments.
> >> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> >> > >> > >> >>> > - I'd rather not have the first alpha depending on
> >> > >> > >> >>> > ZK-1919
> >> > and
> >> > >> > >> ZK-1910,
> >> > >> > >> >>> we can leave it for the second alpha.
> >> > >> > >> >>> >
> >> > >> > >> >>> > If you agree with this, then we should be able to cut a
> >> > >> > >> >>> > candidate by
> >> > >> > >> the
> >> > >> > >> >>> end of this week.
> >> > >> > >> >>> >
> >> > >> > >> >>> > -Flavio
> >> > >> > >> >>> >
> >> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
> >> > >> > >> >>> > <ph...@apache.org>
> >> > >> wrote:
> >> > >> > >> >>> >
> >> > >> > >> >>> >> Per my previous note you can now see the c client test
> >> > >> > >> >>> >> log output
> >> > >> > >> here
> >> > >> > >> >>> >> in the "build artifacts" section:
> >> > >> > >> >>> >>
> >> > >> > >> >>>
> >> > >> > >>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> >> > >> > >> r-
> >> > >> > trunk
> >> > >> > >> /2372/
> >> > >> > >> >>> >>
> >> > >> > >> >>> >> Patrick
> >> > >> > >> >>> >>
> >> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >> > >> > >> >>> >> <ph...@apache.org>
> >> > >> > >> wrote:
> >> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
> >> > >> > >> >>> >>> to me which
> >> > >> > >> >>> >>> one(s?) is new?)
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
> >> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
> >> > >> > >> >>> >>> updated the "archive" list
> >> > >> > >> to
> >> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
> >> > >> > >> >>> >>> go
> >> > to
> >> > >> > >> console
> >> > >> > >> >>> >>> at least we can debug when there is a failure.
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
> >> > server
> >> > >> > >> support
> >> > >> > >> >>> >>> and it looks like that should be ready soon.
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
> >> > >> > >> "ZOOKEEPER-1919
> >> > >> > >> >>> >>> Update the C implementation of removeWatches to have
> >> > >> > >> >>> >>> it
> >> > >> > match
> >> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> >> > >> Flavio/Michi?
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
> >> > >> > >> >>> >>> reconfig
> >> > >> > >> test -
> >> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
> >> > close.
> >> > >> > >> >>> >>> Do
> >> > >> > >> folks
> >> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
> >> > >> > >> >>> >>> (with a few
> >> > >> > >> pending
> >> > >> > >> >>> >>> as above)
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> Patrick
> >> > >> > >> >>> >>>
> >> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
> >> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> >> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> > >> > >> >>> <fp...@yahoo.com.invalid>
> >> > >> > >> >>> >>>> wrote:
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
> >> > >> > >> >>> >>>>> release
> >> > an
> >> > >> > >> >>> >>>>> alpha
> >> > >> > >> >>> version
> >> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
> >> > >> > >> >>> >>>> weeks
> >> > time?
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>
> >> > >> > >> >>> >>>> +1
> >> > >> > >> >>> >>>>
> >> > >> > >> >>> >>>> -rgs
> >> > >> > >> >>> >>>>
> >> > >> > >> >>> >>>>> -Flavio
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
> >> > Segalés <
> >> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
> >> > >> > >> >>> >>>>>> <ph...@apache.org>
> >> > >> > wrote:
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
> >> > >> > >> >>> >>>>>>> in
> >> > the
> >> > >> > >> >>> >>>>>>> last
> >> > >> > >> >>> check).
> >> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
> >> > >> > >> >>> >>>>>>> Camille is
> >> > >> > >> threatening
> >> > >> > >> >>> to
> >> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
> >> > >> > >> >>> >>>>>>> on
> >> > the
> >> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
> >> > >> > >> >>> >>>>>>>
> >> > >> > >> >>> >>>>>>> So here's something else we might consider -
> >> > >> > >> >>> >>>>>>> should we drop
> >> > >> > >> jdk6
> >> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
> >> > >> > >> >>> >>>>>>> but I suspect
> >> > >> > >> some
> >> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
> >> > >> > >> >>> >>>>>>> forward though,
> >> > >> > >> >>> can't
> >> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
> >> > currently
> >> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >> > >> > >> >>> >>>>>>>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> > >> > >> >>> >>>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>> Extra eyes/review for
> >> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
> >> > >> > >> >>> >>>>>> Observers with the
> >> > >> > >> >>> upcoming
> >> > >> > >> >>> >>>>>> alpha release will see there network usage go
> >> wild...).
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>> -rgs
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>> Patrick
> >> > >> > >> >>> >>>>>>>
> >> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
> >> > >> > >> >>> >>>>>>>> but I need a +1
> >> > >> > >> >>> >>>> there. I
> >> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
> >> > >> > >> >>> >>>>>>> failed in the
> >> > >> > >> last QA
> >> > >> > >> >>> run
> >> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
> >> > >> > >> >>> >>>>>>> and
> >> > in
> >> > >> > >> >>> >>>>>>> fact
> >> > >> > >> it
> >> > >> > >> >>> fails
> >> > >> > >> >>> >>>> in
> >> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
> >> > >> > >> >>> >>>>>>> seem
> >> > to
> >> > >> > >> >>> >>>>>>> be
> >> > >> > >> >>> related
> >> > >> > >> >>> >>>> to the
> >> > >> > >> >>> >>>>>>> patch.
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
> >> > >> > >> >>> >>>>>>>> anything concrete
> >> > >> > >> about
> >> > >> > >> >>> it.
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>> -Flavio
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> >> > >> > >> phunt@apache.org>
> >> > >> > >> >>> >>>> wrote:
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> >> > >> > >> reviewed/finalized
> >> > >> > >> >>> before
> >> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
> >> > >> > >> >>> >>>>>>>>> RC
> >> > >> > soonish...
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>> Patrick
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
> >> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid>
> wrote:
> >> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863
> in.
> >> > >> > >> >>> >>>>>>>>>> They are
> >> > >> > >> both
> >> > >> > >> >>> >>>> patch
> >> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
> >> > >> > >> >>> >>>>>>> open because we
> >> > >> > >> >>> need a
> >> > >> > >> >>> >>>> 3.4
> >> > >> > >> >>> >>>>>>> patch.
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>> -Flavio
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> >> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >> > >> > >> >>> wrote:
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
> >> > while, a
> >> > >> > >> >>> >>>>>>>>>>> few
> >> > >> > >> >>> people
> >> > >> > >> >>> >>>> have
> >> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
> >> > >> > >> >>> >>>>>>>>>>> personally
> >> > >> > >> that
> >> > >> > >> >>> they
> >> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
> >> > >> > >> >>> >>>>>>>>>>> 3.5
> >> > >> > release.
> >> > >> > >> Every
> >> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
> >> > >> > >> >>> >>>>>>>>>>> perfection we'll
> >> > >> > >> never
> >> > >> > >> >>> get
> >> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
> >> > >> > >> >>> >>>>>>>>>>> new features,
> >> > >> > >> >>> lots of
> >> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
> >> > >> > >> >>> >>>>>>>>>>> that folks can
> >> > >> > >> use
> >> > >> > >> >>> it,
> >> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
> >> > >> > >> >>> >>>>>>>>>>> for the known
> >> > >> > >> >>> flakey
> >> > >> > >> >>> >>>> test
> >> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> >> > >> > trunk.
> >> > >> > >> >>> >>>>>>>>>>> Note
> >> > >> > >> that
> >> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
> >> > >> > >> >>> >>>>>>>>>>> jdk7
> >> > and
> >> > >> > jdk8.
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
> >> > >> > >> >>> >>>>>>>>>>> our
> >> > >> > >> releases:
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
> >> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> >> > for
> >> > >> > >> >>> >>>>>>>>>>> some
> >> > >> > >> time.
> >> > >> > >> >>> >>>> What I
> >> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
> >> > >> > >> >>> >>>>>>>>>>> we
> >> > did
> >> > >> > >> >>> >>>>>>>>>>> for
> >> > >> > >> 3.4.
> >> > >> > >> >>> >>>> (this is
> >> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
> >> > Hadoop 2
> >> > >> > >> release
> >> > >> > >> >>> >>>> cycle)
> >> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
> >> > >> > >> >>> >>>>>>>>>>> something people
> >> > >> > >> can run
> >> > >> > >> >>> >>>> and
> >> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
> >> > >> > >> >>> >>>>>>>>>>> and
> >> > feel
> >> > >> > >> >>> comfortable
> >> > >> > >> >>> >>>> with
> >> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
> >> beta.
> >> > >> > >> >>> >>>>>>>>>>> Once we
> >> > >> > >> get
> >> > >> > >> >>> >>>> some
> >> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> >> > >> > and
> >> > >> > >> >>> >>>>>>>>>>> look at
> >> > >> > >> >>> making
> >> > >> > >> >>> >>>> it
> >> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
> >> > >> > >> >>> >>>>>>>>>>> the
> >> > >> > >> >>> "current/stable"
> >> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> e.g.
> >> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
> >> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
> >> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> >> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
> >> > >> > >> >>> >>>>>>>>>>> also not
> >> > >> > >> "stable" vs
> >> > >> > >> >>> >>>> 3.4.x,
> >> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
> >> > >> > >> >>> >>>>>>>>>>> expect things to
> >> > >> > >> shake
> >> > >> > >> >>> >>>> out)
> >> > >> > >> >>> >>>>>>>>>>> 3.5.7
> >> > >> > >> >>> >>>>>>>>>>> ....
> >> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
> >> > production
> >> > >> > >> >>> >>>>>>>>>>> use,
> >> > >> > >> >>> stable,
> >> > >> > >> >>> >>>>>>> etc...
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
> >> > >> > >> >>> >>>>>>>>>>> these something
> >> > >> > >> that
> >> > >> > >> >>> >>>> should
> >> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
> >> > >> > >> >>> >>>>>>>>>>> days. If folks
> >> > >> > >> find
> >> > >> > >> >>> >>>> this a
> >> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
> >> > >> > >> >>> >>>>>>>>>>> cut
> >> > an
> >> > >> RC.
> >> > >> > >> >>> >>>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>>> Patrick
> >> > >> > >> >>> >>>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>>>
> >> > >> > >> >>> >>>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >>>>>>
> >> > >> > >> >>> >
> >> > >> > >> >>>
> >> > >> > >>
> >> > >>
> >> > >>
> >> >
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Hi Alex, I've also seen the test "testLeaderTimesoutOnNewQuorum" fail
multiple times (not every time, but ~50%, so flakey) in the last few
days. It's failing both on jdk6 and jdk7. (this is my personal
jenkins, I haven't see any other failures than this during the past
few days).

junit.framework.AssertionFailedError
at org.apache.zookeeper.test.ReconfigTest.testServerHasConfig(ReconfigTest.java:127)
at org.apache.zookeeper.test.ReconfigTest.testLeaderTimesoutOnNewQuorum(ReconfigTest.java:450)
at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)

Patrick

On Tue, Jul 22, 2014 at 8:37 AM, Alexander Shraer <sh...@gmail.com> wrote:
> Hi Rakesh,
>
> Thanks for looking at this. In general even if we find the bug since we
> should test it before committing a fix, it seems better to remove the test
> for now and debug this on a build machine. I'm trying to get access to it.
>
> Looking at this log:
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/
>
> Something weird is going on. Sever 3 hasn't started yet, but version 200000000
> is already being sent around as committed!
>
> 2014-07-21 10:44:50,901 [myid:2] - INFO
> [WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
> - 2 Received version: 200000000 my version: 0
>
>
> and also in leader election messages.
>
> Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
> explicitly started it with 100000000.
> Then it makes sense that the new config can't be committed since its
> version is not high enough...
>
> I wonder if its possible that not all servers from the previous test are
> dead and they are interfering...
>
>
> On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:
>
>> Hi Alex,
>>
>> Yeah it is consistently passing in my machine also.
>>
>>
>> I have quickly gone through the
>> testCurrentObserverIsParticipantInNewConfig failure logs in
>> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version) has
>> not taken and still leader election is seeing 100000000 (n.config version).
>> Unfortunately I didn't find the reason for not considering the updated
>> config version.
>>
>>
>> Reference:
>> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>>
>> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
>> Notification time out: 51200
>> 2014-07-22 06:38:00,330 [myid:1] - INFO
>>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>> 2014-07-22 06:38:00,330 [myid:2] - INFO
>>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
>> Notification time out: 51200
>> 2014-07-22 06:38:00,331 [myid:0] - INFO
>>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>> 2014-07-22 06:38:00,331 [myid:2] - INFO
>>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>>
>>
>> 2014-07-22 06:38:00,332 [myid:0] - INFO
>>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>> 2014-07-22 06:38:00,332 [myid:1] - INFO
>>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
>> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
>> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
>> state)100000000 (n.config version)
>>
>>
>> -Rakesh
>>
>> -----Original Message-----
>> From: Alexander Shraer [mailto:shralex@gmail.com]
>> Sent: 22 July 2014 11:57
>> To: dev@zookeeper.apache.org
>> Subject: Re: ZooKeeper 3.5.0-alpha planning
>>
>> I tried to look into it, but the test consistently passes locally on two
>> machines.
>> I don't currently have access to the build machine, but I can try to ask
>> for access.
>> Unless anyone has a better suggestion, we could remove the failing test in
>> the meanwhile and open a JIRA to add it back...
>>
>>
>> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org> wrote:
>>
>> > I'm seeing alot of test failures in
>> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
>> > Seems related to ZOOKEEPER-1807 recent commit.
>> >
>> >
>> > https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
>> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
>> > tabpanel#comment-14069024
>> >
>> > Patrick
>> >
>> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
>> > <ra...@gmail.com> wrote:
>> > > lgtm +1
>> > >
>> > >
>> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
>> > > <fp...@yahoo.com.invalid>
>> > wrote:
>> > >
>> > >> +1 for having an RC this week. Since this is an alpha release, I
>> > >> +think
>> > 72
>> > >> biz hours is enough for the vote.
>> > >>
>> > >> -Flavio
>> > >>
>> > >> > -----Original Message-----
>> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
>> > >> > Sent: 21 July 2014 18:55
>> > >> > To: DevZooKeeper
>> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> > >> >
>> > >> > I fixed a number of issues. I also started a few threads with
>> > >> > builds@
>> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
>> > through a
>> > >> > number of findbugs issues, it's not closed yet but it's pretty
>> close.
>> > >> >
>> > >> > I don't see why we can't create an RC and start voting this week
>> > though.
>> > >> > Anyone disagree?
>> > >> >
>> > >> > How long should we let the vote run, the std 72 biz hours or
>> > >> > should we
>> > >> plan
>> > >> > for more to allow folks more time to test?
>> > >> >
>> > >> > Patrick
>> > >> >
>> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> > >> > <rg...@itevenworks.net> wrote:
>> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>> > >> > >
>> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
>> > trying
>> > >> > >> to fix some of the jobs that were broken during the recent
>> > >> > >> host upgrade.
>> > >> > >>
>> > >> > >
>> > >> > > How are things looking? Is it likely that we can have a 3.5.0
>> > >> > > alpha release week or are we still blocked on Jenkins?
>> > >> > >
>> > >> > >
>> > >> > > -rgs
>> > >> > >
>> > >> > >
>> > >> > >
>> > >> > >
>> > >> > >
>> > >> > >
>> > >> > >> Patrick
>> > >> > >>
>> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> > >> > >> <mi...@cs.stanford.edu>
>> > >> > >> wrote:
>> > >> > >> > I'll check in ZOOKEEPER-1683.
>> > >> > >> >
>> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> > >> > >> > <sh...@gmail.com>
>> > >> > >> wrote:
>> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
>> > >> > >> >> all
>> > >> > >> subsequent
>> > >> > >> >> changes were formatting as suggested by Rakesh.
>> > >> > >> >>
>> > >> > >> >>
>> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
>> > >> > >> >> <phunt@apache.org
>> > >
>> > >> > wrote:
>> > >> > >> >>
>> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
>> > >> > >> >>> for
>> > e.g.
>> > >> > >> >>> findbugs issues. At the very least our build/test/ci
>> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
>> > >> > >> >>> startServer fix
>> > and
>> > >> > >> >>> some other flakeys that have been addressed go a long way
>> > >> > >> >>> on
>> > that
>> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
>> > >> > >> >>> up before we cut a release. I started a separate thread to
>> > >> > >> >>> discuss
>> > >> the
>> > >> > findbugs issue.
>> > >> > >> >>>
>> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> > >> > >> >>>
>> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>> > >> > >> >>>
>> > >> > >> >>> Patrick
>> > >> > >> >>>
>> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> > >> > >> >>> > My take:
>> > >> > >> >>> >
>> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
>> > >> > >> >>> > go
>> > in.
>> > >> > >> >>> > See
>> > >> > >> the
>> > >> > >> >>> jira for comments.
>> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>> > >> > >> >>> > - I'd rather not have the first alpha depending on
>> > >> > >> >>> > ZK-1919
>> > and
>> > >> > >> ZK-1910,
>> > >> > >> >>> we can leave it for the second alpha.
>> > >> > >> >>> >
>> > >> > >> >>> > If you agree with this, then we should be able to cut a
>> > >> > >> >>> > candidate by
>> > >> > >> the
>> > >> > >> >>> end of this week.
>> > >> > >> >>> >
>> > >> > >> >>> > -Flavio
>> > >> > >> >>> >
>> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
>> > >> > >> >>> > <ph...@apache.org>
>> > >> wrote:
>> > >> > >> >>> >
>> > >> > >> >>> >> Per my previous note you can now see the c client test
>> > >> > >> >>> >> log output
>> > >> > >> here
>> > >> > >> >>> >> in the "build artifacts" section:
>> > >> > >> >>> >>
>> > >> > >> >>>
>> > >> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
>> > >> > >> r-
>> > >> > trunk
>> > >> > >> /2372/
>> > >> > >> >>> >>
>> > >> > >> >>> >> Patrick
>> > >> > >> >>> >>
>> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> > >> > >> >>> >> <ph...@apache.org>
>> > >> > >> wrote:
>> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
>> > >> > >> >>> >>> to me which
>> > >> > >> >>> >>> one(s?) is new?)
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
>> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
>> > >> > >> >>> >>> updated the "archive" list
>> > >> > >> to
>> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
>> > >> > >> >>> >>> go
>> > to
>> > >> > >> console
>> > >> > >> >>> >>> at least we can debug when there is a failure.
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
>> > server
>> > >> > >> support
>> > >> > >> >>> >>> and it looks like that should be ready soon.
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
>> > >> > >> "ZOOKEEPER-1919
>> > >> > >> >>> >>> Update the C implementation of removeWatches to have
>> > >> > >> >>> >>> it
>> > >> > match
>> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>> > >> Flavio/Michi?
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
>> > >> > >> >>> >>> reconfig
>> > >> > >> test -
>> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> > >> > >> >>> >>>
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
>> > close.
>> > >> > >> >>> >>> Do
>> > >> > >> folks
>> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
>> > >> > >> >>> >>> (with a few
>> > >> > >> pending
>> > >> > >> >>> >>> as above)
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> Patrick
>> > >> > >> >>> >>>
>> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
>> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
>> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> > >> > >> >>> <fp...@yahoo.com.invalid>
>> > >> > >> >>> >>>> wrote:
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
>> > >> > >> >>> >>>>> release
>> > an
>> > >> > >> >>> >>>>> alpha
>> > >> > >> >>> version
>> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
>> > >> > >> >>> >>>> weeks
>> > time?
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>
>> > >> > >> >>> >>>> +1
>> > >> > >> >>> >>>>
>> > >> > >> >>> >>>> -rgs
>> > >> > >> >>> >>>>
>> > >> > >> >>> >>>>> -Flavio
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
>> > Segalés <
>> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
>> > >> > >> >>> >>>>>> <ph...@apache.org>
>> > >> > wrote:
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
>> > >> > >> >>> >>>>>>> in
>> > the
>> > >> > >> >>> >>>>>>> last
>> > >> > >> >>> check).
>> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
>> > >> > >> >>> >>>>>>> Camille is
>> > >> > >> threatening
>> > >> > >> >>> to
>> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
>> > >> > >> >>> >>>>>>> on
>> > the
>> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
>> > >> > >> >>> >>>>>>>
>> > >> > >> >>> >>>>>>> So here's something else we might consider -
>> > >> > >> >>> >>>>>>> should we drop
>> > >> > >> jdk6
>> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
>> > >> > >> >>> >>>>>>> but I suspect
>> > >> > >> some
>> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
>> > >> > >> >>> >>>>>>> forward though,
>> > >> > >> >>> can't
>> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
>> > currently
>> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> > >> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> > >> > >> >>> >>>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>> Extra eyes/review for
>> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
>> > >> > >> >>> >>>>>> Observers with the
>> > >> > >> >>> upcoming
>> > >> > >> >>> >>>>>> alpha release will see there network usage go
>> wild...).
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>> -rgs
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>> Patrick
>> > >> > >> >>> >>>>>>>
>> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
>> > >> > >> >>> >>>>>>>> but I need a +1
>> > >> > >> >>> >>>> there. I
>> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
>> > >> > >> >>> >>>>>>> failed in the
>> > >> > >> last QA
>> > >> > >> >>> run
>> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
>> > >> > >> >>> >>>>>>> and
>> > in
>> > >> > >> >>> >>>>>>> fact
>> > >> > >> it
>> > >> > >> >>> fails
>> > >> > >> >>> >>>> in
>> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
>> > >> > >> >>> >>>>>>> seem
>> > to
>> > >> > >> >>> >>>>>>> be
>> > >> > >> >>> related
>> > >> > >> >>> >>>> to the
>> > >> > >> >>> >>>>>>> patch.
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
>> > >> > >> >>> >>>>>>>> anything concrete
>> > >> > >> about
>> > >> > >> >>> it.
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>> -Flavio
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>> > >> > >> phunt@apache.org>
>> > >> > >> >>> >>>> wrote:
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> > >> > >> reviewed/finalized
>> > >> > >> >>> before
>> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
>> > >> > >> >>> >>>>>>>>> RC
>> > >> > soonish...
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>> Patrick
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
>> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid> wrote:
>> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
>> > >> > >> >>> >>>>>>>>>> They are
>> > >> > >> both
>> > >> > >> >>> >>>> patch
>> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
>> > >> > >> >>> >>>>>>> open because we
>> > >> > >> >>> need a
>> > >> > >> >>> >>>> 3.4
>> > >> > >> >>> >>>>>>> patch.
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>> -Flavio
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> > >> > >> >>> wrote:
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
>> > while, a
>> > >> > >> >>> >>>>>>>>>>> few
>> > >> > >> >>> people
>> > >> > >> >>> >>>> have
>> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
>> > >> > >> >>> >>>>>>>>>>> personally
>> > >> > >> that
>> > >> > >> >>> they
>> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
>> > >> > >> >>> >>>>>>>>>>> 3.5
>> > >> > release.
>> > >> > >> Every
>> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
>> > >> > >> >>> >>>>>>>>>>> perfection we'll
>> > >> > >> never
>> > >> > >> >>> get
>> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
>> > >> > >> >>> >>>>>>>>>>> new features,
>> > >> > >> >>> lots of
>> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
>> > >> > >> >>> >>>>>>>>>>> that folks can
>> > >> > >> use
>> > >> > >> >>> it,
>> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
>> > >> > >> >>> >>>>>>>>>>> for the known
>> > >> > >> >>> flakey
>> > >> > >> >>> >>>> test
>> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
>> > >> > trunk.
>> > >> > >> >>> >>>>>>>>>>> Note
>> > >> > >> that
>> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
>> > >> > >> >>> >>>>>>>>>>> jdk7
>> > and
>> > >> > jdk8.
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
>> > >> > >> >>> >>>>>>>>>>> our
>> > >> > >> releases:
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
>> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
>> > for
>> > >> > >> >>> >>>>>>>>>>> some
>> > >> > >> time.
>> > >> > >> >>> >>>> What I
>> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
>> > >> > >> >>> >>>>>>>>>>> we
>> > did
>> > >> > >> >>> >>>>>>>>>>> for
>> > >> > >> 3.4.
>> > >> > >> >>> >>>> (this is
>> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
>> > Hadoop 2
>> > >> > >> release
>> > >> > >> >>> >>>> cycle)
>> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
>> > >> > >> >>> >>>>>>>>>>> something people
>> > >> > >> can run
>> > >> > >> >>> >>>> and
>> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
>> > >> > >> >>> >>>>>>>>>>> and
>> > feel
>> > >> > >> >>> comfortable
>> > >> > >> >>> >>>> with
>> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
>> beta.
>> > >> > >> >>> >>>>>>>>>>> Once we
>> > >> > >> get
>> > >> > >> >>> >>>> some
>> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
>> > >> > and
>> > >> > >> >>> >>>>>>>>>>> look at
>> > >> > >> >>> making
>> > >> > >> >>> >>>> it
>> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
>> > >> > >> >>> >>>>>>>>>>> the
>> > >> > >> >>> "current/stable"
>> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> e.g.
>> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
>> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
>> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
>> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
>> > >> > >> >>> >>>>>>>>>>> also not
>> > >> > >> "stable" vs
>> > >> > >> >>> >>>> 3.4.x,
>> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
>> > >> > >> >>> >>>>>>>>>>> expect things to
>> > >> > >> shake
>> > >> > >> >>> >>>> out)
>> > >> > >> >>> >>>>>>>>>>> 3.5.7
>> > >> > >> >>> >>>>>>>>>>> ....
>> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
>> > production
>> > >> > >> >>> >>>>>>>>>>> use,
>> > >> > >> >>> stable,
>> > >> > >> >>> >>>>>>> etc...
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
>> > >> > >> >>> >>>>>>>>>>> these something
>> > >> > >> that
>> > >> > >> >>> >>>> should
>> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
>> > >> > >> >>> >>>>>>>>>>> days. If folks
>> > >> > >> find
>> > >> > >> >>> >>>> this a
>> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
>> > >> > >> >>> >>>>>>>>>>> cut
>> > an
>> > >> RC.
>> > >> > >> >>> >>>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>>> Patrick
>> > >> > >> >>> >>>>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>>>
>> > >> > >> >>> >>>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >>>>>>
>> > >> > >> >>> >
>> > >> > >> >>>
>> > >> > >>
>> > >>
>> > >>
>> >
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Hi Rakesh,

Thanks for looking at this. In general even if we find the bug since we
should test it before committing a fix, it seems better to remove the test
for now and debug this on a build machine. I'm trying to get access to it.

Looking at this log:
https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2380/testReport/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig/

Something weird is going on. Sever 3 hasn't started yet, but version 200000000
is already being sent around as committed!

2014-07-21 10:44:50,901 [myid:2] - INFO
[WorkerReceiver[myid=2]:FastLeaderElection$Messenger$WorkerReceiver@293]
- 2 Received version: 200000000 my version: 0


and also in leader election messages.

Also weird is that the version of 2 is 0 as if it is a joiner, whereas we
explicitly started it with 100000000.
Then it makes sense that the new config can't be committed since its
version is not high enough...

I wonder if its possible that not all servers from the previous test are
dead and they are interfering...


On Tue, Jul 22, 2014 at 3:53 AM, Rakesh R <ra...@huawei.com> wrote:

> Hi Alex,
>
> Yeah it is consistently passing in my machine also.
>
>
> I have quickly gone through the
> testCurrentObserverIsParticipantInNewConfig failure logs in
> PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version) has
> not taken and still leader election is seeing 100000000 (n.config version).
> Unfortunately I didn't find the reason for not considering the updated
> config version.
>
>
> Reference:
> https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig
>
> 2014-07-22 06:38:00,330 [myid:1] - INFO
>  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] -
> Notification time out: 51200
> 2014-07-22 06:38:00,330 [myid:1] - INFO
>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
> 2014-07-22 06:38:00,331 [myid:2] - INFO
>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
> 2014-07-22 06:38:00,330 [myid:2] - INFO
>  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] -
> Notification time out: 51200
> 2014-07-22 06:38:00,331 [myid:0] - INFO
>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
> 2014-07-22 06:38:00,331 [myid:2] - INFO
>  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
>
>
> 2014-07-22 06:38:00,332 [myid:0] - INFO
>  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
> 2014-07-22 06:38:00,332 [myid:1] - INFO
>  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2
> (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1
> (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my
> state)100000000 (n.config version)
>
>
> -Rakesh
>
> -----Original Message-----
> From: Alexander Shraer [mailto:shralex@gmail.com]
> Sent: 22 July 2014 11:57
> To: dev@zookeeper.apache.org
> Subject: Re: ZooKeeper 3.5.0-alpha planning
>
> I tried to look into it, but the test consistently passes locally on two
> machines.
> I don't currently have access to the build machine, but I can try to ask
> for access.
> Unless anyone has a better suggestion, we could remove the failing test in
> the meanwhile and open a JIRA to add it back...
>
>
> On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org> wrote:
>
> > I'm seeing alot of test failures in
> > testCurrentObserverIsParticipantInNewConfig could someone take a look?
> > Seems related to ZOOKEEPER-1807 recent commit.
> >
> >
> > https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> > 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> > tabpanel#comment-14069024
> >
> > Patrick
> >
> > On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> > <ra...@gmail.com> wrote:
> > > lgtm +1
> > >
> > >
> > > On Mon, Jul 21, 2014 at 11:37 PM, FPJ
> > > <fp...@yahoo.com.invalid>
> > wrote:
> > >
> > >> +1 for having an RC this week. Since this is an alpha release, I
> > >> +think
> > 72
> > >> biz hours is enough for the vote.
> > >>
> > >> -Flavio
> > >>
> > >> > -----Original Message-----
> > >> > From: Patrick Hunt [mailto:phunt@apache.org]
> > >> > Sent: 21 July 2014 18:55
> > >> > To: DevZooKeeper
> > >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> > >> >
> > >> > I fixed a number of issues. I also started a few threads with
> > >> > builds@
> > >> > - the ulimit issue is still outstanding. Hongchao and I worked
> > through a
> > >> > number of findbugs issues, it's not closed yet but it's pretty
> close.
> > >> >
> > >> > I don't see why we can't create an RC and start voting this week
> > though.
> > >> > Anyone disagree?
> > >> >
> > >> > How long should we let the vote run, the std 72 biz hours or
> > >> > should we
> > >> plan
> > >> > for more to allow folks more time to test?
> > >> >
> > >> > Patrick
> > >> >
> > >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> > >> > <rg...@itevenworks.net> wrote:
> > >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> > >> > >
> > >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
> > trying
> > >> > >> to fix some of the jobs that were broken during the recent
> > >> > >> host upgrade.
> > >> > >>
> > >> > >
> > >> > > How are things looking? Is it likely that we can have a 3.5.0
> > >> > > alpha release week or are we still blocked on Jenkins?
> > >> > >
> > >> > >
> > >> > > -rgs
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > >
> > >> > >> Patrick
> > >> > >>
> > >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> > >> > >> <mi...@cs.stanford.edu>
> > >> > >> wrote:
> > >> > >> > I'll check in ZOOKEEPER-1683.
> > >> > >> >
> > >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> > >> > >> > <sh...@gmail.com>
> > >> > >> wrote:
> > >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and
> > >> > >> >> all
> > >> > >> subsequent
> > >> > >> >> changes were formatting as suggested by Rakesh.
> > >> > >> >>
> > >> > >> >>
> > >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt
> > >> > >> >> <phunt@apache.org
> > >
> > >> > wrote:
> > >> > >> >>
> > >> > >> >>> I'm concerned that the CI tests are all failing due to,
> > >> > >> >>> for
> > e.g.
> > >> > >> >>> findbugs issues. At the very least our build/test/ci
> > >> > >> >>> should be pretty clean - some flakeys is ok (the recent
> > >> > >> >>> startServer fix
> > and
> > >> > >> >>> some other flakeys that have been addressed go a long way
> > >> > >> >>> on
> > that
> > >> > >> >>> issue) but I think the findbugs problem should be cleaned
> > >> > >> >>> up before we cut a release. I started a separate thread to
> > >> > >> >>> discuss
> > >> the
> > >> > findbugs issue.
> > >> > >> >>>
> > >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> > >> > >> >>>
> > >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> > >> > >> >>>
> > >> > >> >>> Patrick
> > >> > >> >>>
> > >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> > >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> > >> > >> >>> > My take:
> > >> > >> >>> >
> > >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can
> > >> > >> >>> > go
> > in.
> > >> > >> >>> > See
> > >> > >> the
> > >> > >> >>> jira for comments.
> > >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> > >> > >> >>> > - I'd rather not have the first alpha depending on
> > >> > >> >>> > ZK-1919
> > and
> > >> > >> ZK-1910,
> > >> > >> >>> we can leave it for the second alpha.
> > >> > >> >>> >
> > >> > >> >>> > If you agree with this, then we should be able to cut a
> > >> > >> >>> > candidate by
> > >> > >> the
> > >> > >> >>> end of this week.
> > >> > >> >>> >
> > >> > >> >>> > -Flavio
> > >> > >> >>> >
> > >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt
> > >> > >> >>> > <ph...@apache.org>
> > >> wrote:
> > >> > >> >>> >
> > >> > >> >>> >> Per my previous note you can now see the c client test
> > >> > >> >>> >> log output
> > >> > >> here
> > >> > >> >>> >> in the "build artifacts" section:
> > >> > >> >>> >>
> > >> > >> >>>
> > >> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> > >> > >> r-
> > >> > trunk
> > >> > >> /2372/
> > >> > >> >>> >>
> > >> > >> >>> >> Patrick
> > >> > >> >>> >>
> > >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> > >> > >> >>> >> <ph...@apache.org>
> > >> > >> wrote:
> > >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear
> > >> > >> >>> >>> to me which
> > >> > >> >>> >>> one(s?) is new?)
> > >> > >> >>> >>>
> > >> > >> >>> >>> Looks like the autoconf issue I reported is hitting
> > >> > >> >>> >>> the upgraded apache jenkins instances as well. I've
> > >> > >> >>> >>> updated the "archive" list
> > >> > >> to
> > >> > >> >>> >>> include the c tests stdout redirect. So while it won't
> > >> > >> >>> >>> go
> > to
> > >> > >> console
> > >> > >> >>> >>> at least we can debug when there is a failure.
> > >> > >> >>> >>>
> > >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
> > server
> > >> > >> support
> > >> > >> >>> >>> and it looks like that should be ready soon.
> > >> > >> >>> >>>
> > >> > >> >>> >>> Raul also requested that someone prioritize reviewing
> > >> > >> "ZOOKEEPER-1919
> > >> > >> >>> >>> Update the C implementation of removeWatches to have
> > >> > >> >>> >>> it
> > >> > match
> > >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> > >> Flavio/Michi?
> > >> > >> >>> >>>
> > >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
> > >> > >> >>> >>> reconfig
> > >> > >> test -
> > >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> > >> > >> >>> >>>
> > >> > >> >>> >>>
> > >> > >> >>> >>> Based on previous comments it looks like we're pretty
> > close.
> > >> > >> >>> >>> Do
> > >> > >> folks
> > >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point?
> > >> > >> >>> >>> (with a few
> > >> > >> pending
> > >> > >> >>> >>> as above)
> > >> > >> >>> >>>
> > >> > >> >>> >>> Patrick
> > >> > >> >>> >>>
> > >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez
> > >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> > >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> > >> > >> >>> <fp...@yahoo.com.invalid>
> > >> > >> >>> >>>> wrote:
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>> Just so that we don´t delay too much, what if we
> > >> > >> >>> >>>>> release
> > an
> > >> > >> >>> >>>>> alpha
> > >> > >> >>> version
> > >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3
> > >> > >> >>> >>>> weeks
> > time?
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>
> > >> > >> >>> >>>> +1
> > >> > >> >>> >>>>
> > >> > >> >>> >>>> -rgs
> > >> > >> >>> >>>>
> > >> > >> >>> >>>>> -Flavio
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
> > Segalés <
> > >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt
> > >> > >> >>> >>>>>> <ph...@apache.org>
> > >> > wrote:
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8
> > >> > >> >>> >>>>>>> in
> > the
> > >> > >> >>> >>>>>>> last
> > >> > >> >>> check).
> > >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and
> > >> > >> >>> >>>>>>> Camille is
> > >> > >> threatening
> > >> > >> >>> to
> > >> > >> >>> >>>>>>> commit 1863. I see some great progress in general
> > >> > >> >>> >>>>>>> on
> > the
> > >> > >> >>> >>>>>>> patch availables queue, which is great to see.
> > >> > >> >>> >>>>>>>
> > >> > >> >>> >>>>>>> So here's something else we might consider -
> > >> > >> >>> >>>>>>> should we drop
> > >> > >> jdk6
> > >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle
> > >> > >> >>> >>>>>>> but I suspect
> > >> > >> some
> > >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move
> > >> > >> >>> >>>>>>> forward though,
> > >> > >> >>> can't
> > >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
> > currently
> > >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> > >> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> > >> > >> >>> >>>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>> Extra eyes/review for
> > >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> > >> > >> >>> >>>>>> would be appreciated (otherwise anyone using
> > >> > >> >>> >>>>>> Observers with the
> > >> > >> >>> upcoming
> > >> > >> >>> >>>>>> alpha release will see there network usage go
> wild...).
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>> -rgs
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>> Patrick
> > >> > >> >>> >>>>>>>
> > >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> > >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> > >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already,
> > >> > >> >>> >>>>>>>> but I need a +1
> > >> > >> >>> >>>> there. I
> > >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest
> > >> > >> >>> >>>>>>> failed in the
> > >> > >> last QA
> > >> > >> >>> run
> > >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest,
> > >> > >> >>> >>>>>>> and
> > in
> > >> > >> >>> >>>>>>> fact
> > >> > >> it
> > >> > >> >>> fails
> > >> > >> >>> >>>> in
> > >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't
> > >> > >> >>> >>>>>>> seem
> > to
> > >> > >> >>> >>>>>>> be
> > >> > >> >>> related
> > >> > >> >>> >>>> to the
> > >> > >> >>> >>>>>>> patch.
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say
> > >> > >> >>> >>>>>>>> anything concrete
> > >> > >> about
> > >> > >> >>> it.
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>> -Flavio
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> > >> > >> phunt@apache.org>
> > >> > >> >>> >>>> wrote:
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> > >> > >> reviewed/finalized
> > >> > >> >>> before
> > >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an
> > >> > >> >>> >>>>>>>>> RC
> > >> > soonish...
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>> Patrick
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio
> > >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid> wrote:
> > >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
> > >> > >> >>> >>>>>>>>>> They are
> > >> > >> both
> > >> > >> >>> >>>> patch
> > >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still
> > >> > >> >>> >>>>>>> open because we
> > >> > >> >>> need a
> > >> > >> >>> >>>> 3.4
> > >> > >> >>> >>>>>>> patch.
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>> -Flavio
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> > >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> > >> > >> >>> wrote:
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
> > while, a
> > >> > >> >>> >>>>>>>>>>> few
> > >> > >> >>> people
> > >> > >> >>> >>>> have
> > >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
> > >> > >> >>> >>>>>>>>>>> personally
> > >> > >> that
> > >> > >> >>> they
> > >> > >> >>> >>>>>>>>>>> would like to see some progress on the first
> > >> > >> >>> >>>>>>>>>>> 3.5
> > >> > release.
> > >> > >> Every
> > >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for
> > >> > >> >>> >>>>>>>>>>> perfection we'll
> > >> > >> never
> > >> > >> >>> get
> > >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great
> > >> > >> >>> >>>>>>>>>>> new features,
> > >> > >> >>> lots of
> > >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so
> > >> > >> >>> >>>>>>>>>>> that folks can
> > >> > >> use
> > >> > >> >>> it,
> > >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except
> > >> > >> >>> >>>>>>>>>>> for the known
> > >> > >> >>> flakey
> > >> > >> >>> >>>> test
> > >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> > >> > trunk.
> > >> > >> >>> >>>>>>>>>>> Note
> > >> > >> that
> > >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on
> > >> > >> >>> >>>>>>>>>>> jdk7
> > and
> > >> > jdk8.
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan
> > >> > >> >>> >>>>>>>>>>> our
> > >> > >> releases:
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a
> > >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> > for
> > >> > >> >>> >>>>>>>>>>> some
> > >> > >> time.
> > >> > >> >>> >>>> What I
> > >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what
> > >> > >> >>> >>>>>>>>>>> we
> > did
> > >> > >> >>> >>>>>>>>>>> for
> > >> > >> 3.4.
> > >> > >> >>> >>>> (this is
> > >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
> > Hadoop 2
> > >> > >> release
> > >> > >> >>> >>>> cycle)
> > >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases,
> > >> > >> >>> >>>>>>>>>>> something people
> > >> > >> can run
> > >> > >> >>> >>>> and
> > >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers
> > >> > >> >>> >>>>>>>>>>> and
> > feel
> > >> > >> >>> comfortable
> > >> > >> >>> >>>> with
> > >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to
> beta.
> > >> > >> >>> >>>>>>>>>>> Once we
> > >> > >> get
> > >> > >> >>> >>>> some
> > >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> > >> > and
> > >> > >> >>> >>>>>>>>>>> look at
> > >> > >> >>> making
> > >> > >> >>> >>>> it
> > >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become
> > >> > >> >>> >>>>>>>>>>> the
> > >> > >> >>> "current/stable"
> > >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> e.g.
> > >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3
> > >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta
> > >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> > >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but
> > >> > >> >>> >>>>>>>>>>> also not
> > >> > >> "stable" vs
> > >> > >> >>> >>>> 3.4.x,
> > >> > >> >>> >>>>>>>>>>> maybe use it for production but we still
> > >> > >> >>> >>>>>>>>>>> expect things to
> > >> > >> shake
> > >> > >> >>> >>>> out)
> > >> > >> >>> >>>>>>>>>>> 3.5.7
> > >> > >> >>> >>>>>>>>>>> ....
> > >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
> > production
> > >> > >> >>> >>>>>>>>>>> use,
> > >> > >> >>> stable,
> > >> > >> >>> >>>>>>> etc...
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of
> > >> > >> >>> >>>>>>>>>>> these something
> > >> > >> that
> > >> > >> >>> >>>> should
> > >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple
> > >> > >> >>> >>>>>>>>>>> days. If folks
> > >> > >> find
> > >> > >> >>> >>>> this a
> > >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to
> > >> > >> >>> >>>>>>>>>>> cut
> > an
> > >> RC.
> > >> > >> >>> >>>>>>>>>>>
> > >> > >> >>> >>>>>>>>>>> Patrick
> > >> > >> >>> >>>>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>>>
> > >> > >> >>> >>>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >>>>>>
> > >> > >> >>> >
> > >> > >> >>>
> > >> > >>
> > >>
> > >>
> >
>

RE: ZooKeeper 3.5.0-alpha planning

Posted by Rakesh R <ra...@huawei.com>.
Hi Alex,

Yeah it is consistently passing in my machine also.


I have quickly gone through the testCurrentObserverIsParticipantInNewConfig failure logs in PreCommit-ZOOKEEPER-Build. It looks like 200000000 (n.config version) has not taken and still leader election is seeing 100000000 (n.config version). Unfortunately I didn't find the reason for not considering the updated config version.


Reference: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/2213/testReport/junit/org.apache.zookeeper.server.quorum/ReconfigRecoveryTest/testCurrentObserverIsParticipantInNewConfig

2014-07-22 06:38:00,330 [myid:1] - INFO  [QuorumPeer[myid=1]/127.0.0.1:11298:FastLeaderElection@922] - Notification time out: 51200
2014-07-22 06:38:00,330 [myid:1] - INFO  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)
2014-07-22 06:38:00,331 [myid:2] - INFO  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)
2014-07-22 06:38:00,330 [myid:2] - INFO  [QuorumPeer[myid=2]/127.0.0.1:11301:FastLeaderElection@922] - Notification time out: 51200
2014-07-22 06:38:00,331 [myid:0] - INFO  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)
2014-07-22 06:38:00,331 [myid:2] - INFO  [WorkerReceiver[myid=2]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 1 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)


2014-07-22 06:38:00,332 [myid:0] - INFO  [WorkerReceiver[myid=0]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)
2014-07-22 06:38:00,332 [myid:1] - INFO  [WorkerReceiver[myid=1]:FastLeaderElection@682] - Notification: 2 (message format version), 1 (n.leader), 0x100000005 (n.zxid), 0x1 (n.round), LOOKING (n.state), 2 (n.sid), 0x1 (n.peerEPoch), LOOKING (my state)100000000 (n.config version)


-Rakesh

-----Original Message-----
From: Alexander Shraer [mailto:shralex@gmail.com] 
Sent: 22 July 2014 11:57
To: dev@zookeeper.apache.org
Subject: Re: ZooKeeper 3.5.0-alpha planning

I tried to look into it, but the test consistently passes locally on two machines.
I don't currently have access to the build machine, but I can try to ask for access.
Unless anyone has a better suggestion, we could remove the failing test in the meanwhile and open a JIRA to add it back...


On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org> wrote:

> I'm seeing alot of test failures in
> testCurrentObserverIsParticipantInNewConfig could someone take a look?
> Seems related to ZOOKEEPER-1807 recent commit.
>
>
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=
> 14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-
> tabpanel#comment-14069024
>
> Patrick
>
> On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan 
> <ra...@gmail.com> wrote:
> > lgtm +1
> >
> >
> > On Mon, Jul 21, 2014 at 11:37 PM, FPJ 
> > <fp...@yahoo.com.invalid>
> wrote:
> >
> >> +1 for having an RC this week. Since this is an alpha release, I 
> >> +think
> 72
> >> biz hours is enough for the vote.
> >>
> >> -Flavio
> >>
> >> > -----Original Message-----
> >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >> > Sent: 21 July 2014 18:55
> >> > To: DevZooKeeper
> >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >
> >> > I fixed a number of issues. I also started a few threads with 
> >> > builds@
> >> > - the ulimit issue is still outstanding. Hongchao and I worked
> through a
> >> > number of findbugs issues, it's not closed yet but it's pretty close.
> >> >
> >> > I don't see why we can't create an RC and start voting this week
> though.
> >> > Anyone disagree?
> >> >
> >> > How long should we let the vote run, the std 72 biz hours or 
> >> > should we
> >> plan
> >> > for more to allow folks more time to test?
> >> >
> >> > Patrick
> >> >
> >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés 
> >> > <rg...@itevenworks.net> wrote:
> >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> >> > >
> >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
> trying
> >> > >> to fix some of the jobs that were broken during the recent 
> >> > >> host upgrade.
> >> > >>
> >> > >
> >> > > How are things looking? Is it likely that we can have a 3.5.0 
> >> > > alpha release week or are we still blocked on Jenkins?
> >> > >
> >> > >
> >> > > -rgs
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >> Patrick
> >> > >>
> >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki 
> >> > >> <mi...@cs.stanford.edu>
> >> > >> wrote:
> >> > >> > I'll check in ZOOKEEPER-1683.
> >> > >> >
> >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer 
> >> > >> > <sh...@gmail.com>
> >> > >> wrote:
> >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and 
> >> > >> >> all
> >> > >> subsequent
> >> > >> >> changes were formatting as suggested by Rakesh.
> >> > >> >>
> >> > >> >>
> >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt 
> >> > >> >> <phunt@apache.org
> >
> >> > wrote:
> >> > >> >>
> >> > >> >>> I'm concerned that the CI tests are all failing due to, 
> >> > >> >>> for
> e.g.
> >> > >> >>> findbugs issues. At the very least our build/test/ci 
> >> > >> >>> should be pretty clean - some flakeys is ok (the recent 
> >> > >> >>> startServer fix
> and
> >> > >> >>> some other flakeys that have been addressed go a long way 
> >> > >> >>> on
> that
> >> > >> >>> issue) but I think the findbugs problem should be cleaned 
> >> > >> >>> up before we cut a release. I started a separate thread to 
> >> > >> >>> discuss
> >> the
> >> > findbugs issue.
> >> > >> >>>
> >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> > >> >>>
> >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> >> > >> >>>
> >> > >> >>> Patrick
> >> > >> >>>
> >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira 
> >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> > My take:
> >> > >> >>> >
> >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can 
> >> > >> >>> > go
> in.
> >> > >> >>> > See
> >> > >> the
> >> > >> >>> jira for comments.
> >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> >> > >> >>> > - I'd rather not have the first alpha depending on 
> >> > >> >>> > ZK-1919
> and
> >> > >> ZK-1910,
> >> > >> >>> we can leave it for the second alpha.
> >> > >> >>> >
> >> > >> >>> > If you agree with this, then we should be able to cut a 
> >> > >> >>> > candidate by
> >> > >> the
> >> > >> >>> end of this week.
> >> > >> >>> >
> >> > >> >>> > -Flavio
> >> > >> >>> >
> >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt 
> >> > >> >>> > <ph...@apache.org>
> >> wrote:
> >> > >> >>> >
> >> > >> >>> >> Per my previous note you can now see the c client test 
> >> > >> >>> >> log output
> >> > >> here
> >> > >> >>> >> in the "build artifacts" section:
> >> > >> >>> >>
> >> > >> >>>
> >> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeepe
> >> > >> r-
> >> > trunk
> >> > >> /2372/
> >> > >> >>> >>
> >> > >> >>> >> Patrick
> >> > >> >>> >>
> >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt 
> >> > >> >>> >> <ph...@apache.org>
> >> > >> wrote:
> >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear 
> >> > >> >>> >>> to me which
> >> > >> >>> >>> one(s?) is new?)
> >> > >> >>> >>>
> >> > >> >>> >>> Looks like the autoconf issue I reported is hitting 
> >> > >> >>> >>> the upgraded apache jenkins instances as well. I've 
> >> > >> >>> >>> updated the "archive" list
> >> > >> to
> >> > >> >>> >>> include the c tests stdout redirect. So while it won't 
> >> > >> >>> >>> go
> to
> >> > >> console
> >> > >> >>> >>> at least we can debug when there is a failure.
> >> > >> >>> >>>
> >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
> server
> >> > >> support
> >> > >> >>> >>> and it looks like that should be ready soon.
> >> > >> >>> >>>
> >> > >> >>> >>> Raul also requested that someone prioritize reviewing
> >> > >> "ZOOKEEPER-1919
> >> > >> >>> >>> Update the C implementation of removeWatches to have 
> >> > >> >>> >>> it
> >> > match
> >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> >> Flavio/Michi?
> >> > >> >>> >>>
> >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client 
> >> > >> >>> >>> reconfig
> >> > >> test -
> >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> >> > >> >>> >>>
> >> > >> >>> >>>
> >> > >> >>> >>> Based on previous comments it looks like we're pretty
> close.
> >> > >> >>> >>> Do
> >> > >> folks
> >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point? 
> >> > >> >>> >>> (with a few
> >> > >> pending
> >> > >> >>> >>> as above)
> >> > >> >>> >>>
> >> > >> >>> >>> Patrick
> >> > >> >>> >>>
> >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez 
> >> > >> >>> >>> Segalés <rg...@itevenworks.net> wrote:
> >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> > >> >>> <fp...@yahoo.com.invalid>
> >> > >> >>> >>>> wrote:
> >> > >> >>> >>>>>
> >> > >> >>> >>>>> Just so that we don´t delay too much, what if we 
> >> > >> >>> >>>>> release
> an
> >> > >> >>> >>>>> alpha
> >> > >> >>> version
> >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3 
> >> > >> >>> >>>> weeks
> time?
> >> > >> >>> >>>>>
> >> > >> >>> >>>>
> >> > >> >>> >>>> +1
> >> > >> >>> >>>>
> >> > >> >>> >>>> -rgs
> >> > >> >>> >>>>
> >> > >> >>> >>>>> -Flavio
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>
> >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
> Segalés <
> >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt 
> >> > >> >>> >>>>>> <ph...@apache.org>
> >> > wrote:
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 
> >> > >> >>> >>>>>>> in
> the
> >> > >> >>> >>>>>>> last
> >> > >> >>> check).
> >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and 
> >> > >> >>> >>>>>>> Camille is
> >> > >> threatening
> >> > >> >>> to
> >> > >> >>> >>>>>>> commit 1863. I see some great progress in general 
> >> > >> >>> >>>>>>> on
> the
> >> > >> >>> >>>>>>> patch availables queue, which is great to see.
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>> So here's something else we might consider - 
> >> > >> >>> >>>>>>> should we drop
> >> > >> jdk6
> >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle 
> >> > >> >>> >>>>>>> but I suspect
> >> > >> some
> >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move 
> >> > >> >>> >>>>>>> forward though,
> >> > >> >>> can't
> >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
> currently
> >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> Extra eyes/review for
> >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> > >> >>> >>>>>> would be appreciated (otherwise anyone using 
> >> > >> >>> >>>>>> Observers with the
> >> > >> >>> upcoming
> >> > >> >>> >>>>>> alpha release will see there network usage go wild...).
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> -rgs
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>> Patrick
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira 
> >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already, 
> >> > >> >>> >>>>>>>> but I need a +1
> >> > >> >>> >>>> there. I
> >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest 
> >> > >> >>> >>>>>>> failed in the
> >> > >> last QA
> >> > >> >>> run
> >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest, 
> >> > >> >>> >>>>>>> and
> in
> >> > >> >>> >>>>>>> fact
> >> > >> it
> >> > >> >>> fails
> >> > >> >>> >>>> in
> >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't 
> >> > >> >>> >>>>>>> seem
> to
> >> > >> >>> >>>>>>> be
> >> > >> >>> related
> >> > >> >>> >>>> to the
> >> > >> >>> >>>>>>> patch.
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say 
> >> > >> >>> >>>>>>>> anything concrete
> >> > >> about
> >> > >> >>> it.
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> -Flavio
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> >> > >> phunt@apache.org>
> >> > >> >>> >>>> wrote:
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> >> > >> reviewed/finalized
> >> > >> >>> before
> >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an 
> >> > >> >>> >>>>>>>>> RC
> >> > soonish...
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> Patrick
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio 
> >> > >> >>> >>>>>>>>> Junqueira <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
> >> > >> >>> >>>>>>>>>> They are
> >> > >> both
> >> > >> >>> >>>> patch
> >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still 
> >> > >> >>> >>>>>>> open because we
> >> > >> >>> need a
> >> > >> >>> >>>> 3.4
> >> > >> >>> >>>>>>> patch.
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> -Flavio
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt 
> >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >> > >> >>> wrote:
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
> while, a
> >> > >> >>> >>>>>>>>>>> few
> >> > >> >>> people
> >> > >> >>> >>>> have
> >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me 
> >> > >> >>> >>>>>>>>>>> personally
> >> > >> that
> >> > >> >>> they
> >> > >> >>> >>>>>>>>>>> would like to see some progress on the first 
> >> > >> >>> >>>>>>>>>>> 3.5
> >> > release.
> >> > >> Every
> >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for 
> >> > >> >>> >>>>>>>>>>> perfection we'll
> >> > >> never
> >> > >> >>> get
> >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great 
> >> > >> >>> >>>>>>>>>>> new features,
> >> > >> >>> lots of
> >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so 
> >> > >> >>> >>>>>>>>>>> that folks can
> >> > >> use
> >> > >> >>> it,
> >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except 
> >> > >> >>> >>>>>>>>>>> for the known
> >> > >> >>> flakey
> >> > >> >>> >>>> test
> >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> >> > trunk.
> >> > >> >>> >>>>>>>>>>> Note
> >> > >> that
> >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on 
> >> > >> >>> >>>>>>>>>>> jdk7
> and
> >> > jdk8.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan 
> >> > >> >>> >>>>>>>>>>> our
> >> > >> releases:
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a 
> >> > >> >>> >>>>>>>>>>> 3.5.x-stable
> for
> >> > >> >>> >>>>>>>>>>> some
> >> > >> time.
> >> > >> >>> >>>> What I
> >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what 
> >> > >> >>> >>>>>>>>>>> we
> did
> >> > >> >>> >>>>>>>>>>> for
> >> > >> 3.4.
> >> > >> >>> >>>> (this is
> >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
> Hadoop 2
> >> > >> release
> >> > >> >>> >>>> cycle)
> >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases, 
> >> > >> >>> >>>>>>>>>>> something people
> >> > >> can run
> >> > >> >>> >>>> and
> >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers 
> >> > >> >>> >>>>>>>>>>> and
> feel
> >> > >> >>> comfortable
> >> > >> >>> >>>> with
> >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta.
> >> > >> >>> >>>>>>>>>>> Once we
> >> > >> get
> >> > >> >>> >>>> some
> >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> >> > and
> >> > >> >>> >>>>>>>>>>> look at
> >> > >> >>> making
> >> > >> >>> >>>> it
> >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become 
> >> > >> >>> >>>>>>>>>>> the
> >> > >> >>> "current/stable"
> >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> e.g.
> >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3 
> >> > >> >>> >>>>>>>>>>> blockers) 3.5.2-alpha (0 blockers) 3.5.3-beta 
> >> > >> >>> >>>>>>>>>>> (apis locked) 3.5.4-beta 3.5.5-beta
> >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but 
> >> > >> >>> >>>>>>>>>>> also not
> >> > >> "stable" vs
> >> > >> >>> >>>> 3.4.x,
> >> > >> >>> >>>>>>>>>>> maybe use it for production but we still 
> >> > >> >>> >>>>>>>>>>> expect things to
> >> > >> shake
> >> > >> >>> >>>> out)
> >> > >> >>> >>>>>>>>>>> 3.5.7
> >> > >> >>> >>>>>>>>>>> ....
> >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
> production
> >> > >> >>> >>>>>>>>>>> use,
> >> > >> >>> stable,
> >> > >> >>> >>>>>>> etc...
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of 
> >> > >> >>> >>>>>>>>>>> these something
> >> > >> that
> >> > >> >>> >>>> should
> >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple 
> >> > >> >>> >>>>>>>>>>> days. If folks
> >> > >> find
> >> > >> >>> >>>> this a
> >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to 
> >> > >> >>> >>>>>>>>>>> cut
> an
> >> RC.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Patrick
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >
> >> > >> >>>
> >> > >>
> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
I tried to look into it, but the test consistently passes locally on two
machines.
I don't currently have access to the build machine, but I can try to ask
for access.
Unless anyone has a better suggestion, we could remove the failing test in
the meanwhile
and open a JIRA to add it back...


On Mon, Jul 21, 2014 at 10:09 PM, Patrick Hunt <ph...@apache.org> wrote:

> I'm seeing alot of test failures in
> testCurrentObserverIsParticipantInNewConfig could someone take a look?
> Seems related to ZOOKEEPER-1807 recent commit.
>
>
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14069024
>
> Patrick
>
> On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
> <ra...@gmail.com> wrote:
> > lgtm +1
> >
> >
> > On Mon, Jul 21, 2014 at 11:37 PM, FPJ <fp...@yahoo.com.invalid>
> wrote:
> >
> >> +1 for having an RC this week. Since this is an alpha release, I think
> 72
> >> biz hours is enough for the vote.
> >>
> >> -Flavio
> >>
> >> > -----Original Message-----
> >> > From: Patrick Hunt [mailto:phunt@apache.org]
> >> > Sent: 21 July 2014 18:55
> >> > To: DevZooKeeper
> >> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >> >
> >> > I fixed a number of issues. I also started a few threads with builds@
> >> > - the ulimit issue is still outstanding. Hongchao and I worked
> through a
> >> > number of findbugs issues, it's not closed yet but it's pretty close.
> >> >
> >> > I don't see why we can't create an RC and start voting this week
> though.
> >> > Anyone disagree?
> >> >
> >> > How long should we let the vote run, the std 72 biz hours or should we
> >> plan
> >> > for more to allow folks more time to test?
> >> >
> >> > Patrick
> >> >
> >> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> >> > <rg...@itevenworks.net> wrote:
> >> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> >> > >
> >> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm
> trying
> >> > >> to fix some of the jobs that were broken during the recent host
> >> > >> upgrade.
> >> > >>
> >> > >
> >> > > How are things looking? Is it likely that we can have a 3.5.0 alpha
> >> > > release week or are we still blocked on Jenkins?
> >> > >
> >> > >
> >> > > -rgs
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >
> >> > >> Patrick
> >> > >>
> >> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >> > >> <mi...@cs.stanford.edu>
> >> > >> wrote:
> >> > >> > I'll check in ZOOKEEPER-1683.
> >> > >> >
> >> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >> > >> > <sh...@gmail.com>
> >> > >> wrote:
> >> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
> >> > >> subsequent
> >> > >> >> changes were formatting as suggested by Rakesh.
> >> > >> >>
> >> > >> >>
> >> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <phunt@apache.org
> >
> >> > wrote:
> >> > >> >>
> >> > >> >>> I'm concerned that the CI tests are all failing due to, for
> e.g.
> >> > >> >>> findbugs issues. At the very least our build/test/ci should be
> >> > >> >>> pretty clean - some flakeys is ok (the recent startServer fix
> and
> >> > >> >>> some other flakeys that have been addressed go a long way on
> that
> >> > >> >>> issue) but I think the findbugs problem should be cleaned up
> >> > >> >>> before we cut a release. I started a separate thread to discuss
> >> the
> >> > findbugs issue.
> >> > >> >>>
> >> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> > >> >>>
> >> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> >> > >> >>>
> >> > >> >>> Patrick
> >> > >> >>>
> >> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >> > >> >>> <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> > My take:
> >> > >> >>> >
> >> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can go
> in.
> >> > >> >>> > See
> >> > >> the
> >> > >> >>> jira for comments.
> >> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> >> > >> >>> > - I'd rather not have the first alpha depending on ZK-1919
> and
> >> > >> ZK-1910,
> >> > >> >>> we can leave it for the second alpha.
> >> > >> >>> >
> >> > >> >>> > If you agree with this, then we should be able to cut a
> >> > >> >>> > candidate by
> >> > >> the
> >> > >> >>> end of this week.
> >> > >> >>> >
> >> > >> >>> > -Flavio
> >> > >> >>> >
> >> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org>
> >> wrote:
> >> > >> >>> >
> >> > >> >>> >> Per my previous note you can now see the c client test log
> >> > >> >>> >> output
> >> > >> here
> >> > >> >>> >> in the "build artifacts" section:
> >> > >> >>> >>
> >> > >> >>>
> >> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-
> >> > trunk
> >> > >> /2372/
> >> > >> >>> >>
> >> > >> >>> >> Patrick
> >> > >> >>> >>
> >> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >> > >> >>> >> <ph...@apache.org>
> >> > >> wrote:
> >> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me
> >> > >> >>> >>> which
> >> > >> >>> >>> one(s?) is new?)
> >> > >> >>> >>>
> >> > >> >>> >>> Looks like the autoconf issue I reported is hitting the
> >> > >> >>> >>> upgraded apache jenkins instances as well. I've updated the
> >> > >> >>> >>> "archive" list
> >> > >> to
> >> > >> >>> >>> include the c tests stdout redirect. So while it won't go
> to
> >> > >> console
> >> > >> >>> >>> at least we can debug when there is a failure.
> >> > >> >>> >>>
> >> > >> >>> >>> Raul has been helping Bill with reviews for the jetty
> server
> >> > >> support
> >> > >> >>> >>> and it looks like that should be ready soon.
> >> > >> >>> >>>
> >> > >> >>> >>> Raul also requested that someone prioritize reviewing
> >> > >> "ZOOKEEPER-1919
> >> > >> >>> >>> Update the C implementation of removeWatches to have it
> >> > match
> >> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> >> Flavio/Michi?
> >> > >> >>> >>>
> >> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
> >> > >> >>> >>> reconfig
> >> > >> test -
> >> > >> >>> >>> kudos on helping cleanup the build/test infra!
> >> > >> >>> >>>
> >> > >> >>> >>>
> >> > >> >>> >>> Based on previous comments it looks like we're pretty
> close.
> >> > >> >>> >>> Do
> >> > >> folks
> >> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a
> >> > >> >>> >>> few
> >> > >> pending
> >> > >> >>> >>> as above)
> >> > >> >>> >>>
> >> > >> >>> >>> Patrick
> >> > >> >>> >>>
> >> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> >> > >> >>> >>> <rg...@itevenworks.net> wrote:
> >> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> > >> >>> <fp...@yahoo.com.invalid>
> >> > >> >>> >>>> wrote:
> >> > >> >>> >>>>>
> >> > >> >>> >>>>> Just so that we don´t delay too much, what if we release
> an
> >> > >> >>> >>>>> alpha
> >> > >> >>> version
> >> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks
> time?
> >> > >> >>> >>>>>
> >> > >> >>> >>>>
> >> > >> >>> >>>> +1
> >> > >> >>> >>>>
> >> > >> >>> >>>> -rgs
> >> > >> >>> >>>>
> >> > >> >>> >>>>> -Flavio
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>
> >> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez
> Segalés <
> >> > >> >>> >>>> rgs@itevenworks.net> wrote:
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org>
> >> > wrote:
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in
> the
> >> > >> >>> >>>>>>> last
> >> > >> >>> check).
> >> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
> >> > >> threatening
> >> > >> >>> to
> >> > >> >>> >>>>>>> commit 1863. I see some great progress in general on
> the
> >> > >> >>> >>>>>>> patch availables queue, which is great to see.
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>> So here's something else we might consider - should we
> >> > >> >>> >>>>>>> drop
> >> > >> jdk6
> >> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I
> >> > >> >>> >>>>>>> suspect
> >> > >> some
> >> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward
> >> > >> >>> >>>>>>> though,
> >> > >> >>> can't
> >> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are
> currently
> >> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> Extra eyes/review for
> >> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> > >> >>> >>>>>> would be appreciated (otherwise anyone using Observers
> >> > >> >>> >>>>>> with the
> >> > >> >>> upcoming
> >> > >> >>> >>>>>> alpha release will see there network usage go wild...).
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>> -rgs
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>> Patrick
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I
> >> > >> >>> >>>>>>>> need a +1
> >> > >> >>> >>>> there. I
> >> > >> >>> >>>>>>> think Michi hasn't checked in because LETest failed in
> >> > >> >>> >>>>>>> the
> >> > >> last QA
> >> > >> >>> run
> >> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest, and
> in
> >> > >> >>> >>>>>>> fact
> >> > >> it
> >> > >> >>> fails
> >> > >> >>> >>>> in
> >> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem
> to
> >> > >> >>> >>>>>>> be
> >> > >> >>> related
> >> > >> >>> >>>> to the
> >> > >> >>> >>>>>>> patch.
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything
> >> > >> >>> >>>>>>>> concrete
> >> > >> about
> >> > >> >>> it.
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> -Flavio
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> >> > >> phunt@apache.org>
> >> > >> >>> >>>> wrote:
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> >> > >> reviewed/finalized
> >> > >> >>> before
> >> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC
> >> > soonish...
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> Patrick
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
> >> > >> >>> >>>>>>>>>> They are
> >> > >> both
> >> > >> >>> >>>> patch
> >> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open
> >> > >> >>> >>>>>>> because we
> >> > >> >>> need a
> >> > >> >>> >>>> 3.4
> >> > >> >>> >>>>>>> patch.
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> -Flavio
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> >> > >> >>> >>>>>>>>>> <ph...@apache.org>
> >> > >> >>> wrote:
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a
> while, a
> >> > >> >>> >>>>>>>>>>> few
> >> > >> >>> people
> >> > >> >>> >>>> have
> >> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
> >> > >> >>> >>>>>>>>>>> personally
> >> > >> that
> >> > >> >>> they
> >> > >> >>> >>>>>>>>>>> would like to see some progress on the first 3.5
> >> > release.
> >> > >> Every
> >> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection
> >> > >> >>> >>>>>>>>>>> we'll
> >> > >> never
> >> > >> >>> get
> >> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new
> >> > >> >>> >>>>>>>>>>> features,
> >> > >> >>> lots of
> >> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so that
> >> > >> >>> >>>>>>>>>>> folks can
> >> > >> use
> >> > >> >>> it,
> >> > >> >>> >>>>>>>>>>> test it, and give feedback.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the
> >> > >> >>> >>>>>>>>>>> known
> >> > >> >>> flakey
> >> > >> >>> >>>> test
> >> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> >> > trunk.
> >> > >> >>> >>>>>>>>>>> Note
> >> > >> that
> >> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7
> and
> >> > jdk8.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
> >> > >> releases:
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable
> for
> >> > >> >>> >>>>>>>>>>> some
> >> > >> time.
> >> > >> >>> >>>> What I
> >> > >> >>> >>>>>>>>>>> think we should do instead is similar to what we
> did
> >> > >> >>> >>>>>>>>>>> for
> >> > >> 3.4.
> >> > >> >>> >>>> (this is
> >> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their
> Hadoop 2
> >> > >> release
> >> > >> >>> >>>> cycle)
> >> > >> >>> >>>>>>>>>>> Start with a series of alpha releases, something
> >> > >> >>> >>>>>>>>>>> people
> >> > >> can run
> >> > >> >>> >>>> and
> >> > >> >>> >>>>>>>>>>> test with, once we address all the blockers and
> feel
> >> > >> >>> comfortable
> >> > >> >>> >>>> with
> >> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta.
> >> > >> >>> >>>>>>>>>>> Once we
> >> > >> get
> >> > >> >>> >>>> some
> >> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> >> > and
> >> > >> >>> >>>>>>>>>>> look at
> >> > >> >>> making
> >> > >> >>> >>>> it
> >> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become the
> >> > >> >>> "current/stable"
> >> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> e.g.
> >> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3 blockers)
> >> > >> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers) 3.5.3-beta (apis locked)
> >> > >> >>> >>>>>>>>>>> 3.5.4-beta 3.5.5-beta
> >> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
> >> > >> "stable" vs
> >> > >> >>> >>>> 3.4.x,
> >> > >> >>> >>>>>>>>>>> maybe use it for production but we still expect
> >> > >> >>> >>>>>>>>>>> things to
> >> > >> shake
> >> > >> >>> >>>> out)
> >> > >> >>> >>>>>>>>>>> 3.5.7
> >> > >> >>> >>>>>>>>>>> ....
> >> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for
> production
> >> > >> >>> >>>>>>>>>>> use,
> >> > >> >>> stable,
> >> > >> >>> >>>>>>> etc...
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these
> >> > >> >>> >>>>>>>>>>> something
> >> > >> that
> >> > >> >>> >>>> should
> >> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If
> >> > >> >>> >>>>>>>>>>> folks
> >> > >> find
> >> > >> >>> >>>> this a
> >> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut
> an
> >> RC.
> >> > >> >>> >>>>>>>>>>>
> >> > >> >>> >>>>>>>>>>> Patrick
> >> > >> >>> >>>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>>>
> >> > >> >>> >>>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >>>>>>
> >> > >> >>> >
> >> > >> >>>
> >> > >>
> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I'm seeing alot of test failures in
testCurrentObserverIsParticipantInNewConfig could someone take a look?
Seems related to ZOOKEEPER-1807 recent commit.

https://issues.apache.org/jira/browse/ZOOKEEPER-1807?focusedCommentId=14069024&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14069024

Patrick

On Mon, Jul 21, 2014 at 11:12 AM, Rakesh Radhakrishnan
<ra...@gmail.com> wrote:
> lgtm +1
>
>
> On Mon, Jul 21, 2014 at 11:37 PM, FPJ <fp...@yahoo.com.invalid> wrote:
>
>> +1 for having an RC this week. Since this is an alpha release, I think 72
>> biz hours is enough for the vote.
>>
>> -Flavio
>>
>> > -----Original Message-----
>> > From: Patrick Hunt [mailto:phunt@apache.org]
>> > Sent: 21 July 2014 18:55
>> > To: DevZooKeeper
>> > Subject: Re: ZooKeeper 3.5.0-alpha planning
>> >
>> > I fixed a number of issues. I also started a few threads with builds@
>> > - the ulimit issue is still outstanding. Hongchao and I worked through a
>> > number of findbugs issues, it's not closed yet but it's pretty close.
>> >
>> > I don't see why we can't create an RC and start voting this week though.
>> > Anyone disagree?
>> >
>> > How long should we let the vote run, the std 72 biz hours or should we
>> plan
>> > for more to allow folks more time to test?
>> >
>> > Patrick
>> >
>> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
>> > <rg...@itevenworks.net> wrote:
>> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>> > >
>> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
>> > >> to fix some of the jobs that were broken during the recent host
>> > >> upgrade.
>> > >>
>> > >
>> > > How are things looking? Is it likely that we can have a 3.5.0 alpha
>> > > release week or are we still blocked on Jenkins?
>> > >
>> > >
>> > > -rgs
>> > >
>> > >
>> > >
>> > >
>> > >
>> > >
>> > >> Patrick
>> > >>
>> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
>> > >> <mi...@cs.stanford.edu>
>> > >> wrote:
>> > >> > I'll check in ZOOKEEPER-1683.
>> > >> >
>> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
>> > >> > <sh...@gmail.com>
>> > >> wrote:
>> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
>> > >> subsequent
>> > >> >> changes were formatting as suggested by Rakesh.
>> > >> >>
>> > >> >>
>> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org>
>> > wrote:
>> > >> >>
>> > >> >>> I'm concerned that the CI tests are all failing due to, for e.g.
>> > >> >>> findbugs issues. At the very least our build/test/ci should be
>> > >> >>> pretty clean - some flakeys is ok (the recent startServer fix and
>> > >> >>> some other flakeys that have been addressed go a long way on that
>> > >> >>> issue) but I think the findbugs problem should be cleaned up
>> > >> >>> before we cut a release. I started a separate thread to discuss
>> the
>> > findbugs issue.
>> > >> >>>
>> > >> >>> Otw we seem to be in ok shape - 1863 is in.
>> > >> >>>
>> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
>> > >> >>>
>> > >> >>> Patrick
>> > >> >>>
>> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> > >> >>> <fp...@yahoo.com.invalid> wrote:
>> > >> >>> > My take:
>> > >> >>> >
>> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can go in.
>> > >> >>> > See
>> > >> the
>> > >> >>> jira for comments.
>> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
>> > >> >>> > - I'd rather not have the first alpha depending on ZK-1919 and
>> > >> ZK-1910,
>> > >> >>> we can leave it for the second alpha.
>> > >> >>> >
>> > >> >>> > If you agree with this, then we should be able to cut a
>> > >> >>> > candidate by
>> > >> the
>> > >> >>> end of this week.
>> > >> >>> >
>> > >> >>> > -Flavio
>> > >> >>> >
>> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org>
>> wrote:
>> > >> >>> >
>> > >> >>> >> Per my previous note you can now see the c client test log
>> > >> >>> >> output
>> > >> here
>> > >> >>> >> in the "build artifacts" section:
>> > >> >>> >>
>> > >> >>>
>> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-
>> > trunk
>> > >> /2372/
>> > >> >>> >>
>> > >> >>> >> Patrick
>> > >> >>> >>
>> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
>> > >> >>> >> <ph...@apache.org>
>> > >> wrote:
>> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me
>> > >> >>> >>> which
>> > >> >>> >>> one(s?) is new?)
>> > >> >>> >>>
>> > >> >>> >>> Looks like the autoconf issue I reported is hitting the
>> > >> >>> >>> upgraded apache jenkins instances as well. I've updated the
>> > >> >>> >>> "archive" list
>> > >> to
>> > >> >>> >>> include the c tests stdout redirect. So while it won't go to
>> > >> console
>> > >> >>> >>> at least we can debug when there is a failure.
>> > >> >>> >>>
>> > >> >>> >>> Raul has been helping Bill with reviews for the jetty server
>> > >> support
>> > >> >>> >>> and it looks like that should be ready soon.
>> > >> >>> >>>
>> > >> >>> >>> Raul also requested that someone prioritize reviewing
>> > >> "ZOOKEEPER-1919
>> > >> >>> >>> Update the C implementation of removeWatches to have it
>> > match
>> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
>> Flavio/Michi?
>> > >> >>> >>>
>> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
>> > >> >>> >>> reconfig
>> > >> test -
>> > >> >>> >>> kudos on helping cleanup the build/test infra!
>> > >> >>> >>>
>> > >> >>> >>>
>> > >> >>> >>> Based on previous comments it looks like we're pretty close.
>> > >> >>> >>> Do
>> > >> folks
>> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a
>> > >> >>> >>> few
>> > >> pending
>> > >> >>> >>> as above)
>> > >> >>> >>>
>> > >> >>> >>> Patrick
>> > >> >>> >>>
>> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>> > >> >>> >>> <rg...@itevenworks.net> wrote:
>> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> > >> >>> <fp...@yahoo.com.invalid>
>> > >> >>> >>>> wrote:
>> > >> >>> >>>>>
>> > >> >>> >>>>> Just so that we don´t delay too much, what if we release an
>> > >> >>> >>>>> alpha
>> > >> >>> version
>> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>> > >> >>> >>>>>
>> > >> >>> >>>>
>> > >> >>> >>>> +1
>> > >> >>> >>>>
>> > >> >>> >>>> -rgs
>> > >> >>> >>>>
>> > >> >>> >>>>> -Flavio
>> > >> >>> >>>>>
>> > >> >>> >>>>>
>> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>> > >> >>> >>>> rgs@itevenworks.net> wrote:
>> > >> >>> >>>>>
>> > >> >>> >>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org>
>> > wrote:
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the
>> > >> >>> >>>>>>> last
>> > >> >>> check).
>> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
>> > >> threatening
>> > >> >>> to
>> > >> >>> >>>>>>> commit 1863. I see some great progress in general on the
>> > >> >>> >>>>>>> patch availables queue, which is great to see.
>> > >> >>> >>>>>>>
>> > >> >>> >>>>>>> So here's something else we might consider - should we
>> > >> >>> >>>>>>> drop
>> > >> jdk6
>> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I
>> > >> >>> >>>>>>> suspect
>> > >> some
>> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward
>> > >> >>> >>>>>>> though,
>> > >> >>> can't
>> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are currently
>> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> > >> >>> >>>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>> Extra eyes/review for
>> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> > >> >>> >>>>>> would be appreciated (otherwise anyone using Observers
>> > >> >>> >>>>>> with the
>> > >> >>> upcoming
>> > >> >>> >>>>>> alpha release will see there network usage go wild...).
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>> -rgs
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>> Patrick
>> > >> >>> >>>>>>>
>> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I
>> > >> >>> >>>>>>>> need a +1
>> > >> >>> >>>> there. I
>> > >> >>> >>>>>>> think Michi hasn't checked in because LETest failed in
>> > >> >>> >>>>>>> the
>> > >> last QA
>> > >> >>> run
>> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest, and in
>> > >> >>> >>>>>>> fact
>> > >> it
>> > >> >>> fails
>> > >> >>> >>>> in
>> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem to
>> > >> >>> >>>>>>> be
>> > >> >>> related
>> > >> >>> >>>> to the
>> > >> >>> >>>>>>> patch.
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything
>> > >> >>> >>>>>>>> concrete
>> > >> about
>> > >> >>> it.
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>> -Flavio
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>> > >> phunt@apache.org>
>> > >> >>> >>>> wrote:
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> > >> reviewed/finalized
>> > >> >>> before
>> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC
>> > soonish...
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>> Patrick
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
>> > >> >>> >>>>>>>>>> They are
>> > >> both
>> > >> >>> >>>> patch
>> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open
>> > >> >>> >>>>>>> because we
>> > >> >>> need a
>> > >> >>> >>>> 3.4
>> > >> >>> >>>>>>> patch.
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>> -Flavio
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
>> > >> >>> >>>>>>>>>> <ph...@apache.org>
>> > >> >>> wrote:
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a
>> > >> >>> >>>>>>>>>>> few
>> > >> >>> people
>> > >> >>> >>>> have
>> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
>> > >> >>> >>>>>>>>>>> personally
>> > >> that
>> > >> >>> they
>> > >> >>> >>>>>>>>>>> would like to see some progress on the first 3.5
>> > release.
>> > >> Every
>> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection
>> > >> >>> >>>>>>>>>>> we'll
>> > >> never
>> > >> >>> get
>> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new
>> > >> >>> >>>>>>>>>>> features,
>> > >> >>> lots of
>> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so that
>> > >> >>> >>>>>>>>>>> folks can
>> > >> use
>> > >> >>> it,
>> > >> >>> >>>>>>>>>>> test it, and give feedback.
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the
>> > >> >>> >>>>>>>>>>> known
>> > >> >>> flakey
>> > >> >>> >>>> test
>> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
>> > trunk.
>> > >> >>> >>>>>>>>>>> Note
>> > >> that
>> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and
>> > jdk8.
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
>> > >> releases:
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for
>> > >> >>> >>>>>>>>>>> some
>> > >> time.
>> > >> >>> >>>> What I
>> > >> >>> >>>>>>>>>>> think we should do instead is similar to what we did
>> > >> >>> >>>>>>>>>>> for
>> > >> 3.4.
>> > >> >>> >>>> (this is
>> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2
>> > >> release
>> > >> >>> >>>> cycle)
>> > >> >>> >>>>>>>>>>> Start with a series of alpha releases, something
>> > >> >>> >>>>>>>>>>> people
>> > >> can run
>> > >> >>> >>>> and
>> > >> >>> >>>>>>>>>>> test with, once we address all the blockers and feel
>> > >> >>> comfortable
>> > >> >>> >>>> with
>> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta.
>> > >> >>> >>>>>>>>>>> Once we
>> > >> get
>> > >> >>> >>>> some
>> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
>> > and
>> > >> >>> >>>>>>>>>>> look at
>> > >> >>> making
>> > >> >>> >>>> it
>> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become the
>> > >> >>> "current/stable"
>> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> e.g.
>> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3 blockers)
>> > >> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers) 3.5.3-beta (apis locked)
>> > >> >>> >>>>>>>>>>> 3.5.4-beta 3.5.5-beta
>> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
>> > >> "stable" vs
>> > >> >>> >>>> 3.4.x,
>> > >> >>> >>>>>>>>>>> maybe use it for production but we still expect
>> > >> >>> >>>>>>>>>>> things to
>> > >> shake
>> > >> >>> >>>> out)
>> > >> >>> >>>>>>>>>>> 3.5.7
>> > >> >>> >>>>>>>>>>> ....
>> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production
>> > >> >>> >>>>>>>>>>> use,
>> > >> >>> stable,
>> > >> >>> >>>>>>> etc...
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these
>> > >> >>> >>>>>>>>>>> something
>> > >> that
>> > >> >>> >>>> should
>> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If
>> > >> >>> >>>>>>>>>>> folks
>> > >> find
>> > >> >>> >>>> this a
>> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an
>> RC.
>> > >> >>> >>>>>>>>>>>
>> > >> >>> >>>>>>>>>>> Patrick
>> > >> >>> >>>>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>>>
>> > >> >>> >>>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >>>>>>
>> > >> >>> >
>> > >> >>>
>> > >>
>>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Rakesh Radhakrishnan <ra...@gmail.com>.
lgtm +1


On Mon, Jul 21, 2014 at 11:37 PM, FPJ <fp...@yahoo.com.invalid> wrote:

> +1 for having an RC this week. Since this is an alpha release, I think 72
> biz hours is enough for the vote.
>
> -Flavio
>
> > -----Original Message-----
> > From: Patrick Hunt [mailto:phunt@apache.org]
> > Sent: 21 July 2014 18:55
> > To: DevZooKeeper
> > Subject: Re: ZooKeeper 3.5.0-alpha planning
> >
> > I fixed a number of issues. I also started a few threads with builds@
> > - the ulimit issue is still outstanding. Hongchao and I worked through a
> > number of findbugs issues, it's not closed yet but it's pretty close.
> >
> > I don't see why we can't create an RC and start voting this week though.
> > Anyone disagree?
> >
> > How long should we let the vote run, the std 72 biz hours or should we
> plan
> > for more to allow folks more time to test?
> >
> > Patrick
> >
> > On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> > <rg...@itevenworks.net> wrote:
> > > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> > >
> > >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
> > >> to fix some of the jobs that were broken during the recent host
> > >> upgrade.
> > >>
> > >
> > > How are things looking? Is it likely that we can have a 3.5.0 alpha
> > > release week or are we still blocked on Jenkins?
> > >
> > >
> > > -rgs
> > >
> > >
> > >
> > >
> > >
> > >
> > >> Patrick
> > >>
> > >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> > >> <mi...@cs.stanford.edu>
> > >> wrote:
> > >> > I'll check in ZOOKEEPER-1683.
> > >> >
> > >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> > >> > <sh...@gmail.com>
> > >> wrote:
> > >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
> > >> subsequent
> > >> >> changes were formatting as suggested by Rakesh.
> > >> >>
> > >> >>
> > >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org>
> > wrote:
> > >> >>
> > >> >>> I'm concerned that the CI tests are all failing due to, for e.g.
> > >> >>> findbugs issues. At the very least our build/test/ci should be
> > >> >>> pretty clean - some flakeys is ok (the recent startServer fix and
> > >> >>> some other flakeys that have been addressed go a long way on that
> > >> >>> issue) but I think the findbugs problem should be cleaned up
> > >> >>> before we cut a release. I started a separate thread to discuss
> the
> > findbugs issue.
> > >> >>>
> > >> >>> Otw we seem to be in ok shape - 1863 is in.
> > >> >>>
> > >> >>> Anyone have a chance to give feedback to Raul on 1919?
> > >> >>>
> > >> >>> Patrick
> > >> >>>
> > >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> > >> >>> <fp...@yahoo.com.invalid> wrote:
> > >> >>> > My take:
> > >> >>> >
> > >> >>> > - ZK-1863 is pending review. It is a blocker and it can go in.
> > >> >>> > See
> > >> the
> > >> >>> jira for comments.
> > >> >>> > - We can try to have ZK-1807 in for the first alpha.
> > >> >>> > - I'd rather not have the first alpha depending on ZK-1919 and
> > >> ZK-1910,
> > >> >>> we can leave it for the second alpha.
> > >> >>> >
> > >> >>> > If you agree with this, then we should be able to cut a
> > >> >>> > candidate by
> > >> the
> > >> >>> end of this week.
> > >> >>> >
> > >> >>> > -Flavio
> > >> >>> >
> > >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org>
> wrote:
> > >> >>> >
> > >> >>> >> Per my previous note you can now see the c client test log
> > >> >>> >> output
> > >> here
> > >> >>> >> in the "build artifacts" section:
> > >> >>> >>
> > >> >>>
> > >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-
> > trunk
> > >> /2372/
> > >> >>> >>
> > >> >>> >> Patrick
> > >> >>> >>
> > >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> > >> >>> >> <ph...@apache.org>
> > >> wrote:
> > >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me
> > >> >>> >>> which
> > >> >>> >>> one(s?) is new?)
> > >> >>> >>>
> > >> >>> >>> Looks like the autoconf issue I reported is hitting the
> > >> >>> >>> upgraded apache jenkins instances as well. I've updated the
> > >> >>> >>> "archive" list
> > >> to
> > >> >>> >>> include the c tests stdout redirect. So while it won't go to
> > >> console
> > >> >>> >>> at least we can debug when there is a failure.
> > >> >>> >>>
> > >> >>> >>> Raul has been helping Bill with reviews for the jetty server
> > >> support
> > >> >>> >>> and it looks like that should be ready soon.
> > >> >>> >>>
> > >> >>> >>> Raul also requested that someone prioritize reviewing
> > >> "ZOOKEEPER-1919
> > >> >>> >>> Update the C implementation of removeWatches to have it
> > match
> > >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0.
> Flavio/Michi?
> > >> >>> >>>
> > >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
> > >> >>> >>> reconfig
> > >> test -
> > >> >>> >>> kudos on helping cleanup the build/test infra!
> > >> >>> >>>
> > >> >>> >>>
> > >> >>> >>> Based on previous comments it looks like we're pretty close.
> > >> >>> >>> Do
> > >> folks
> > >> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a
> > >> >>> >>> few
> > >> pending
> > >> >>> >>> as above)
> > >> >>> >>>
> > >> >>> >>> Patrick
> > >> >>> >>>
> > >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> > >> >>> >>> <rg...@itevenworks.net> wrote:
> > >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> > >> >>> <fp...@yahoo.com.invalid>
> > >> >>> >>>> wrote:
> > >> >>> >>>>>
> > >> >>> >>>>> Just so that we don´t delay too much, what if we release an
> > >> >>> >>>>> alpha
> > >> >>> version
> > >> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
> > >> >>> >>>>>
> > >> >>> >>>>
> > >> >>> >>>> +1
> > >> >>> >>>>
> > >> >>> >>>> -rgs
> > >> >>> >>>>
> > >> >>> >>>>> -Flavio
> > >> >>> >>>>>
> > >> >>> >>>>>
> > >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
> > >> >>> >>>> rgs@itevenworks.net> wrote:
> > >> >>> >>>>>
> > >> >>> >>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org>
> > wrote:
> > >> >>> >>>>>>
> > >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the
> > >> >>> >>>>>>> last
> > >> >>> check).
> > >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
> > >> threatening
> > >> >>> to
> > >> >>> >>>>>>> commit 1863. I see some great progress in general on the
> > >> >>> >>>>>>> patch availables queue, which is great to see.
> > >> >>> >>>>>>>
> > >> >>> >>>>>>> So here's something else we might consider - should we
> > >> >>> >>>>>>> drop
> > >> jdk6
> > >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I
> > >> >>> >>>>>>> suspect
> > >> some
> > >> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward
> > >> >>> >>>>>>> though,
> > >> >>> can't
> > >> >>> >>>>>>> support it forever. Thoughts? Note that we are currently
> > >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> > >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> > >> >>> >>>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>> Extra eyes/review for
> > >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> > >> >>> >>>>>> would be appreciated (otherwise anyone using Observers
> > >> >>> >>>>>> with the
> > >> >>> upcoming
> > >> >>> >>>>>> alpha release will see there network usage go wild...).
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>> -rgs
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>> Patrick
> > >> >>> >>>>>>>
> > >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> > >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> > >> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I
> > >> >>> >>>>>>>> need a +1
> > >> >>> >>>> there. I
> > >> >>> >>>>>>> think Michi hasn't checked in because LETest failed in
> > >> >>> >>>>>>> the
> > >> last QA
> > >> >>> run
> > >> >>> >>>>>>> there. However, that patch doesn't affect LETest, and in
> > >> >>> >>>>>>> fact
> > >> it
> > >> >>> fails
> > >> >>> >>>> in
> > >> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem to
> > >> >>> >>>>>>> be
> > >> >>> related
> > >> >>> >>>> to the
> > >> >>> >>>>>>> patch.
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything
> > >> >>> >>>>>>>> concrete
> > >> about
> > >> >>> it.
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>> -Flavio
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> > >> phunt@apache.org>
> > >> >>> >>>> wrote:
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> > >> reviewed/finalized
> > >> >>> before
> > >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC
> > soonish...
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>> Patrick
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> > >> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
> > >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
> > >> >>> >>>>>>>>>> They are
> > >> both
> > >> >>> >>>> patch
> > >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open
> > >> >>> >>>>>>> because we
> > >> >>> need a
> > >> >>> >>>> 3.4
> > >> >>> >>>>>>> patch.
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>> -Flavio
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> > >> >>> >>>>>>>>>> <ph...@apache.org>
> > >> >>> wrote:
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a
> > >> >>> >>>>>>>>>>> few
> > >> >>> people
> > >> >>> >>>> have
> > >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
> > >> >>> >>>>>>>>>>> personally
> > >> that
> > >> >>> they
> > >> >>> >>>>>>>>>>> would like to see some progress on the first 3.5
> > release.
> > >> Every
> > >> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection
> > >> >>> >>>>>>>>>>> we'll
> > >> never
> > >> >>> get
> > >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new
> > >> >>> >>>>>>>>>>> features,
> > >> >>> lots of
> > >> >>> >>>>>>>>>>> hard work, let's get it out in a release so that
> > >> >>> >>>>>>>>>>> folks can
> > >> use
> > >> >>> it,
> > >> >>> >>>>>>>>>>> test it, and give feedback.
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the
> > >> >>> >>>>>>>>>>> known
> > >> >>> flakey
> > >> >>> >>>> test
> > >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> > trunk.
> > >> >>> >>>>>>>>>>> Note
> > >> that
> > >> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and
> > jdk8.
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
> > >> releases:
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for
> > >> >>> >>>>>>>>>>> some
> > >> time.
> > >> >>> >>>> What I
> > >> >>> >>>>>>>>>>> think we should do instead is similar to what we did
> > >> >>> >>>>>>>>>>> for
> > >> 3.4.
> > >> >>> >>>> (this is
> > >> >>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2
> > >> release
> > >> >>> >>>> cycle)
> > >> >>> >>>>>>>>>>> Start with a series of alpha releases, something
> > >> >>> >>>>>>>>>>> people
> > >> can run
> > >> >>> >>>> and
> > >> >>> >>>>>>>>>>> test with, once we address all the blockers and feel
> > >> >>> comfortable
> > >> >>> >>>> with
> > >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta.
> > >> >>> >>>>>>>>>>> Once we
> > >> get
> > >> >>> >>>> some
> > >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> > and
> > >> >>> >>>>>>>>>>> look at
> > >> >>> making
> > >> >>> >>>> it
> > >> >>> >>>>>>>>>>> "stable'. At some later point it will become the
> > >> >>> "current/stable"
> > >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> e.g.
> > >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3 blockers)
> > >> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers) 3.5.3-beta (apis locked)
> > >> >>> >>>>>>>>>>> 3.5.4-beta 3.5.5-beta
> > >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
> > >> "stable" vs
> > >> >>> >>>> 3.4.x,
> > >> >>> >>>>>>>>>>> maybe use it for production but we still expect
> > >> >>> >>>>>>>>>>> things to
> > >> shake
> > >> >>> >>>> out)
> > >> >>> >>>>>>>>>>> 3.5.7
> > >> >>> >>>>>>>>>>> ....
> > >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production
> > >> >>> >>>>>>>>>>> use,
> > >> >>> stable,
> > >> >>> >>>>>>> etc...
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these
> > >> >>> >>>>>>>>>>> something
> > >> that
> > >> >>> >>>> should
> > >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If
> > >> >>> >>>>>>>>>>> folks
> > >> find
> > >> >>> >>>> this a
> > >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an
> RC.
> > >> >>> >>>>>>>>>>>
> > >> >>> >>>>>>>>>>> Patrick
> > >> >>> >>>>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>>>
> > >> >>> >>>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >>>>>>
> > >> >>> >
> > >> >>>
> > >>
>
>

RE: ZooKeeper 3.5.0-alpha planning

Posted by FPJ <fp...@yahoo.com.INVALID>.
+1 for having an RC this week. Since this is an alpha release, I think 72 biz hours is enough for the vote. 

-Flavio

> -----Original Message-----
> From: Patrick Hunt [mailto:phunt@apache.org]
> Sent: 21 July 2014 18:55
> To: DevZooKeeper
> Subject: Re: ZooKeeper 3.5.0-alpha planning
> 
> I fixed a number of issues. I also started a few threads with builds@
> - the ulimit issue is still outstanding. Hongchao and I worked through a
> number of findbugs issues, it's not closed yet but it's pretty close.
> 
> I don't see why we can't create an RC and start voting this week though.
> Anyone disagree?
> 
> How long should we let the vote run, the std 72 biz hours or should we plan
> for more to allow folks more time to test?
> 
> Patrick
> 
> On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
> <rg...@itevenworks.net> wrote:
> > On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
> >
> >> You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
> >> to fix some of the jobs that were broken during the recent host
> >> upgrade.
> >>
> >
> > How are things looking? Is it likely that we can have a 3.5.0 alpha
> > release week or are we still blocked on Jenkins?
> >
> >
> > -rgs
> >
> >
> >
> >
> >
> >
> >> Patrick
> >>
> >> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki
> >> <mi...@cs.stanford.edu>
> >> wrote:
> >> > I'll check in ZOOKEEPER-1683.
> >> >
> >> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer
> >> > <sh...@gmail.com>
> >> wrote:
> >> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
> >> subsequent
> >> >> changes were formatting as suggested by Rakesh.
> >> >>
> >> >>
> >> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org>
> wrote:
> >> >>
> >> >>> I'm concerned that the CI tests are all failing due to, for e.g.
> >> >>> findbugs issues. At the very least our build/test/ci should be
> >> >>> pretty clean - some flakeys is ok (the recent startServer fix and
> >> >>> some other flakeys that have been addressed go a long way on that
> >> >>> issue) but I think the findbugs problem should be cleaned up
> >> >>> before we cut a release. I started a separate thread to discuss the
> findbugs issue.
> >> >>>
> >> >>> Otw we seem to be in ok shape - 1863 is in.
> >> >>>
> >> >>> Anyone have a chance to give feedback to Raul on 1919?
> >> >>>
> >> >>> Patrick
> >> >>>
> >> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >> >>> <fp...@yahoo.com.invalid> wrote:
> >> >>> > My take:
> >> >>> >
> >> >>> > - ZK-1863 is pending review. It is a blocker and it can go in.
> >> >>> > See
> >> the
> >> >>> jira for comments.
> >> >>> > - We can try to have ZK-1807 in for the first alpha.
> >> >>> > - I'd rather not have the first alpha depending on ZK-1919 and
> >> ZK-1910,
> >> >>> we can leave it for the second alpha.
> >> >>> >
> >> >>> > If you agree with this, then we should be able to cut a
> >> >>> > candidate by
> >> the
> >> >>> end of this week.
> >> >>> >
> >> >>> > -Flavio
> >> >>> >
> >> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
> >> >>> >
> >> >>> >> Per my previous note you can now see the c client test log
> >> >>> >> output
> >> here
> >> >>> >> in the "build artifacts" section:
> >> >>> >>
> >> >>>
> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-
> trunk
> >> /2372/
> >> >>> >>
> >> >>> >> Patrick
> >> >>> >>
> >> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt
> >> >>> >> <ph...@apache.org>
> >> wrote:
> >> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me
> >> >>> >>> which
> >> >>> >>> one(s?) is new?)
> >> >>> >>>
> >> >>> >>> Looks like the autoconf issue I reported is hitting the
> >> >>> >>> upgraded apache jenkins instances as well. I've updated the
> >> >>> >>> "archive" list
> >> to
> >> >>> >>> include the c tests stdout redirect. So while it won't go to
> >> console
> >> >>> >>> at least we can debug when there is a failure.
> >> >>> >>>
> >> >>> >>> Raul has been helping Bill with reviews for the jetty server
> >> support
> >> >>> >>> and it looks like that should be ready soon.
> >> >>> >>>
> >> >>> >>> Raul also requested that someone prioritize reviewing
> >> "ZOOKEEPER-1919
> >> >>> >>> Update the C implementation of removeWatches to have it
> match
> >> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
> >> >>> >>>
> >> >>> >>> Hongchao got a patch in to cleanup the flakey c client
> >> >>> >>> reconfig
> >> test -
> >> >>> >>> kudos on helping cleanup the build/test infra!
> >> >>> >>>
> >> >>> >>>
> >> >>> >>> Based on previous comments it looks like we're pretty close.
> >> >>> >>> Do
> >> folks
> >> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a
> >> >>> >>> few
> >> pending
> >> >>> >>> as above)
> >> >>> >>>
> >> >>> >>> Patrick
> >> >>> >>>
> >> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> >> >>> >>> <rg...@itevenworks.net> wrote:
> >> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >> >>> <fp...@yahoo.com.invalid>
> >> >>> >>>> wrote:
> >> >>> >>>>>
> >> >>> >>>>> Just so that we don´t delay too much, what if we release an
> >> >>> >>>>> alpha
> >> >>> version
> >> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
> >> >>> >>>>>
> >> >>> >>>>
> >> >>> >>>> +1
> >> >>> >>>>
> >> >>> >>>> -rgs
> >> >>> >>>>
> >> >>> >>>>> -Flavio
> >> >>> >>>>>
> >> >>> >>>>>
> >> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
> >> >>> >>>> rgs@itevenworks.net> wrote:
> >> >>> >>>>>
> >> >>> >>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org>
> wrote:
> >> >>> >>>>>>
> >> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the
> >> >>> >>>>>>> last
> >> >>> check).
> >> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
> >> threatening
> >> >>> to
> >> >>> >>>>>>> commit 1863. I see some great progress in general on the
> >> >>> >>>>>>> patch availables queue, which is great to see.
> >> >>> >>>>>>>
> >> >>> >>>>>>> So here's something else we might consider - should we
> >> >>> >>>>>>> drop
> >> jdk6
> >> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I
> >> >>> >>>>>>> suspect
> >> some
> >> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward
> >> >>> >>>>>>> though,
> >> >>> can't
> >> >>> >>>>>>> support it forever. Thoughts? Note that we are currently
> >> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >> >>> >>>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>> Extra eyes/review for
> >> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >> >>> >>>>>> would be appreciated (otherwise anyone using Observers
> >> >>> >>>>>> with the
> >> >>> upcoming
> >> >>> >>>>>> alpha release will see there network usage go wild...).
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>> -rgs
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>> Patrick
> >> >>> >>>>>>>
> >> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I
> >> >>> >>>>>>>> need a +1
> >> >>> >>>> there. I
> >> >>> >>>>>>> think Michi hasn't checked in because LETest failed in
> >> >>> >>>>>>> the
> >> last QA
> >> >>> run
> >> >>> >>>>>>> there. However, that patch doesn't affect LETest, and in
> >> >>> >>>>>>> fact
> >> it
> >> >>> fails
> >> >>> >>>> in
> >> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem to
> >> >>> >>>>>>> be
> >> >>> related
> >> >>> >>>> to the
> >> >>> >>>>>>> patch.
> >> >>> >>>>>>>>
> >> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything
> >> >>> >>>>>>>> concrete
> >> about
> >> >>> it.
> >> >>> >>>>>>>>
> >> >>> >>>>>>>> -Flavio
> >> >>> >>>>>>>>
> >> >>> >>>>>>>>
> >> >>> >>>>>>>>
> >> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> >> phunt@apache.org>
> >> >>> >>>> wrote:
> >> >>> >>>>>>>>
> >> >>> >>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> >> reviewed/finalized
> >> >>> before
> >> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC
> soonish...
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>> Patrick
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
> >> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in.
> >> >>> >>>>>>>>>> They are
> >> both
> >> >>> >>>> patch
> >> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open
> >> >>> >>>>>>> because we
> >> >>> need a
> >> >>> >>>> 3.4
> >> >>> >>>>>>> patch.
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>> -Flavio
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt
> >> >>> >>>>>>>>>> <ph...@apache.org>
> >> >>> wrote:
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a
> >> >>> >>>>>>>>>>> few
> >> >>> people
> >> >>> >>>> have
> >> >>> >>>>>>>>>>> mentioned on the list as well as contacted me
> >> >>> >>>>>>>>>>> personally
> >> that
> >> >>> they
> >> >>> >>>>>>>>>>> would like to see some progress on the first 3.5
> release.
> >> Every
> >> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection
> >> >>> >>>>>>>>>>> we'll
> >> never
> >> >>> get
> >> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new
> >> >>> >>>>>>>>>>> features,
> >> >>> lots of
> >> >>> >>>>>>>>>>> hard work, let's get it out in a release so that
> >> >>> >>>>>>>>>>> folks can
> >> use
> >> >>> it,
> >> >>> >>>>>>>>>>> test it, and give feedback.
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the
> >> >>> >>>>>>>>>>> known
> >> >>> flakey
> >> >>> >>>> test
> >> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to
> trunk.
> >> >>> >>>>>>>>>>> Note
> >> that
> >> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and
> jdk8.
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
> >> releases:
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for
> >> >>> >>>>>>>>>>> some
> >> time.
> >> >>> >>>> What I
> >> >>> >>>>>>>>>>> think we should do instead is similar to what we did
> >> >>> >>>>>>>>>>> for
> >> 3.4.
> >> >>> >>>> (this is
> >> >>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2
> >> release
> >> >>> >>>> cycle)
> >> >>> >>>>>>>>>>> Start with a series of alpha releases, something
> >> >>> >>>>>>>>>>> people
> >> can run
> >> >>> >>>> and
> >> >>> >>>>>>>>>>> test with, once we address all the blockers and feel
> >> >>> comfortable
> >> >>> >>>> with
> >> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta.
> >> >>> >>>>>>>>>>> Once we
> >> get
> >> >>> >>>> some
> >> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker
> and
> >> >>> >>>>>>>>>>> look at
> >> >>> making
> >> >>> >>>> it
> >> >>> >>>>>>>>>>> "stable'. At some later point it will become the
> >> >>> "current/stable"
> >> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> e.g.
> >> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers) 3.5.1-alpha (3 blockers)
> >> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers) 3.5.3-beta (apis locked)
> >> >>> >>>>>>>>>>> 3.5.4-beta 3.5.5-beta
> >> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
> >> "stable" vs
> >> >>> >>>> 3.4.x,
> >> >>> >>>>>>>>>>> maybe use it for production but we still expect
> >> >>> >>>>>>>>>>> things to
> >> shake
> >> >>> >>>> out)
> >> >>> >>>>>>>>>>> 3.5.7
> >> >>> >>>>>>>>>>> ....
> >> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production
> >> >>> >>>>>>>>>>> use,
> >> >>> stable,
> >> >>> >>>>>>> etc...
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these
> >> >>> >>>>>>>>>>> something
> >> that
> >> >>> >>>> should
> >> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If
> >> >>> >>>>>>>>>>> folks
> >> find
> >> >>> >>>> this a
> >> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
> >> >>> >>>>>>>>>>>
> >> >>> >>>>>>>>>>> Patrick
> >> >>> >>>>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>>>
> >> >>> >>>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >>>>>>
> >> >>> >
> >> >>>
> >>


Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I fixed a number of issues. I also started a few threads with builds@
- the ulimit issue is still outstanding. Hongchao and I worked through
a number of findbugs issues, it's not closed yet but it's pretty
close.

I don't see why we can't create an RC and start voting this week
though. Anyone disagree?

How long should we let the vote run, the std 72 biz hours or should we
plan for more to allow folks more time to test?

Patrick

On Mon, Jul 21, 2014 at 10:29 AM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:
>
>> You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
>> to fix some of the jobs that were broken during the recent host
>> upgrade.
>>
>
> How are things looking? Is it likely that we can have a 3.5.0 alpha release
> week or are
> we still blocked on Jenkins?
>
>
> -rgs
>
>
>
>
>
>
>> Patrick
>>
>> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki <mi...@cs.stanford.edu>
>> wrote:
>> > I'll check in ZOOKEEPER-1683.
>> >
>> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer <sh...@gmail.com>
>> wrote:
>> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
>> subsequent
>> >> changes were formatting as suggested by Rakesh.
>> >>
>> >>
>> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org> wrote:
>> >>
>> >>> I'm concerned that the CI tests are all failing due to, for e.g.
>> >>> findbugs issues. At the very least our build/test/ci should be pretty
>> >>> clean - some flakeys is ok (the recent startServer fix and some other
>> >>> flakeys that have been addressed go a long way on that issue) but I
>> >>> think the findbugs problem should be cleaned up before we cut a
>> >>> release. I started a separate thread to discuss the findbugs issue.
>> >>>
>> >>> Otw we seem to be in ok shape - 1863 is in.
>> >>>
>> >>> Anyone have a chance to give feedback to Raul on 1919?
>> >>>
>> >>> Patrick
>> >>>
>> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> >>> <fp...@yahoo.com.invalid> wrote:
>> >>> > My take:
>> >>> >
>> >>> > - ZK-1863 is pending review. It is a blocker and it can go in. See
>> the
>> >>> jira for comments.
>> >>> > - We can try to have ZK-1807 in for the first alpha.
>> >>> > - I'd rather not have the first alpha depending on ZK-1919 and
>> ZK-1910,
>> >>> we can leave it for the second alpha.
>> >>> >
>> >>> > If you agree with this, then we should be able to cut a candidate by
>> the
>> >>> end of this week.
>> >>> >
>> >>> > -Flavio
>> >>> >
>> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
>> >>> >
>> >>> >> Per my previous note you can now see the c client test log output
>> here
>> >>> >> in the "build artifacts" section:
>> >>> >>
>> >>>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
>> >>> >>
>> >>> >> Patrick
>> >>> >>
>> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
>> >>> >>> one(s?) is new?)
>> >>> >>>
>> >>> >>> Looks like the autoconf issue I reported is hitting the upgraded
>> >>> >>> apache jenkins instances as well. I've updated the "archive" list
>> to
>> >>> >>> include the c tests stdout redirect. So while it won't go to
>> console
>> >>> >>> at least we can debug when there is a failure.
>> >>> >>>
>> >>> >>> Raul has been helping Bill with reviews for the jetty server
>> support
>> >>> >>> and it looks like that should be ready soon.
>> >>> >>>
>> >>> >>> Raul also requested that someone prioritize reviewing
>> "ZOOKEEPER-1919
>> >>> >>> Update the C implementation of removeWatches to have it match
>> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>> >>> >>>
>> >>> >>> Hongchao got a patch in to cleanup the flakey c client reconfig
>> test -
>> >>> >>> kudos on helping cleanup the build/test infra!
>> >>> >>>
>> >>> >>>
>> >>> >>> Based on previous comments it looks like we're pretty close. Do
>> folks
>> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a few
>> pending
>> >>> >>> as above)
>> >>> >>>
>> >>> >>> Patrick
>> >>> >>>
>> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>> >>> >>> <rg...@itevenworks.net> wrote:
>> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> >>> <fp...@yahoo.com.invalid>
>> >>> >>>> wrote:
>> >>> >>>>>
>> >>> >>>>> Just so that we don´t delay too much, what if we release an alpha
>> >>> version
>> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>> >>> >>>>>
>> >>> >>>>
>> >>> >>>> +1
>> >>> >>>>
>> >>> >>>> -rgs
>> >>> >>>>
>> >>> >>>>> -Flavio
>> >>> >>>>>
>> >>> >>>>>
>> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>> >>> >>>> rgs@itevenworks.net> wrote:
>> >>> >>>>>
>> >>> >>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>> >>> >>>>>>
>> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last
>> >>> check).
>> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
>> threatening
>> >>> to
>> >>> >>>>>>> commit 1863. I see some great progress in general on the patch
>> >>> >>>>>>> availables queue, which is great to see.
>> >>> >>>>>>>
>> >>> >>>>>>> So here's something else we might consider - should we drop
>> jdk6
>> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect
>> some
>> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward though,
>> >>> can't
>> >>> >>>>>>> support it forever. Thoughts? Note that we are currently
>> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >>> >>>>>>>
>> >>> >>>>>>
>> >>> >>>>>> Extra eyes/review for
>> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >>> >>>>>> would be appreciated (otherwise anyone using Observers with the
>> >>> upcoming
>> >>> >>>>>> alpha release will see there network usage go wild...).
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>> -rgs
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>>> Patrick
>> >>> >>>>>>>
>> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
>> >>> >>>> there. I
>> >>> >>>>>>> think Michi hasn't checked in because LETest failed in the
>> last QA
>> >>> run
>> >>> >>>>>>> there. However, that patch doesn't affect LETest, and in fact
>> it
>> >>> fails
>> >>> >>>> in
>> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem to be
>> >>> related
>> >>> >>>> to the
>> >>> >>>>>>> patch.
>> >>> >>>>>>>>
>> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete
>> about
>> >>> it.
>> >>> >>>>>>>>
>> >>> >>>>>>>> -Flavio
>> >>> >>>>>>>>
>> >>> >>>>>>>>
>> >>> >>>>>>>>
>> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
>> phunt@apache.org>
>> >>> >>>> wrote:
>> >>> >>>>>>>>
>> >>> >>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
>> reviewed/finalized
>> >>> before
>> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
>> >>> >>>>>>>>>
>> >>> >>>>>>>>> Patrick
>> >>> >>>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are
>> both
>> >>> >>>> patch
>> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open because we
>> >>> need a
>> >>> >>>> 3.4
>> >>> >>>>>>> patch.
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>> -Flavio
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org>
>> >>> wrote:
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a few
>> >>> people
>> >>> >>>> have
>> >>> >>>>>>>>>>> mentioned on the list as well as contacted me personally
>> that
>> >>> they
>> >>> >>>>>>>>>>> would like to see some progress on the first 3.5 release.
>> Every
>> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection we'll
>> never
>> >>> get
>> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new features,
>> >>> lots of
>> >>> >>>>>>>>>>> hard work, let's get it out in a release so that folks can
>> use
>> >>> it,
>> >>> >>>>>>>>>>> test it, and give feedback.
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the known
>> >>> flakey
>> >>> >>>> test
>> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note
>> that
>> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
>> releases:
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some
>> time.
>> >>> >>>> What I
>> >>> >>>>>>>>>>> think we should do instead is similar to what we did for
>> 3.4.
>> >>> >>>> (this is
>> >>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2
>> release
>> >>> >>>> cycle)
>> >>> >>>>>>>>>>> Start with a series of alpha releases, something people
>> can run
>> >>> >>>> and
>> >>> >>>>>>>>>>> test with, once we address all the blockers and feel
>> >>> comfortable
>> >>> >>>> with
>> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we
>> get
>> >>> >>>> some
>> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at
>> >>> making
>> >>> >>>> it
>> >>> >>>>>>>>>>> "stable'. At some later point it will become the
>> >>> "current/stable"
>> >>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> e.g.
>> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>> >>> >>>>>>>>>>> 3.5.1-alpha (3 blockers)
>> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers)
>> >>> >>>>>>>>>>> 3.5.3-beta (apis locked)
>> >>> >>>>>>>>>>> 3.5.4-beta
>> >>> >>>>>>>>>>> 3.5.5-beta
>> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
>> "stable" vs
>> >>> >>>> 3.4.x,
>> >>> >>>>>>>>>>> maybe use it for production but we still expect things to
>> shake
>> >>> >>>> out)
>> >>> >>>>>>>>>>> 3.5.7
>> >>> >>>>>>>>>>> ....
>> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use,
>> >>> stable,
>> >>> >>>>>>> etc...
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these something
>> that
>> >>> >>>> should
>> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If folks
>> find
>> >>> >>>> this a
>> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>> >>> >>>>>>>>>>>
>> >>> >>>>>>>>>>> Patrick
>> >>> >>>>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>>>
>> >>> >>>>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >>>>>>
>> >>> >
>> >>>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 18 July 2014 10:32, Patrick Hunt <ph...@apache.org> wrote:

> You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
> to fix some of the jobs that were broken during the recent host
> upgrade.
>

How are things looking? Is it likely that we can have a 3.5.0 alpha release
week or are
we still blocked on Jenkins?


-rgs






> Patrick
>
> On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki <mi...@cs.stanford.edu>
> wrote:
> > I'll check in ZOOKEEPER-1683.
> >
> > On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer <sh...@gmail.com>
> wrote:
> >> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all
> subsequent
> >> changes were formatting as suggested by Rakesh.
> >>
> >>
> >> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org> wrote:
> >>
> >>> I'm concerned that the CI tests are all failing due to, for e.g.
> >>> findbugs issues. At the very least our build/test/ci should be pretty
> >>> clean - some flakeys is ok (the recent startServer fix and some other
> >>> flakeys that have been addressed go a long way on that issue) but I
> >>> think the findbugs problem should be cleaned up before we cut a
> >>> release. I started a separate thread to discuss the findbugs issue.
> >>>
> >>> Otw we seem to be in ok shape - 1863 is in.
> >>>
> >>> Anyone have a chance to give feedback to Raul on 1919?
> >>>
> >>> Patrick
> >>>
> >>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> >>> <fp...@yahoo.com.invalid> wrote:
> >>> > My take:
> >>> >
> >>> > - ZK-1863 is pending review. It is a blocker and it can go in. See
> the
> >>> jira for comments.
> >>> > - We can try to have ZK-1807 in for the first alpha.
> >>> > - I'd rather not have the first alpha depending on ZK-1919 and
> ZK-1910,
> >>> we can leave it for the second alpha.
> >>> >
> >>> > If you agree with this, then we should be able to cut a candidate by
> the
> >>> end of this week.
> >>> >
> >>> > -Flavio
> >>> >
> >>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
> >>> >
> >>> >> Per my previous note you can now see the c client test log output
> here
> >>> >> in the "build artifacts" section:
> >>> >>
> >>>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
> >>> >>
> >>> >> Patrick
> >>> >>
> >>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org>
> wrote:
> >>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
> >>> >>> one(s?) is new?)
> >>> >>>
> >>> >>> Looks like the autoconf issue I reported is hitting the upgraded
> >>> >>> apache jenkins instances as well. I've updated the "archive" list
> to
> >>> >>> include the c tests stdout redirect. So while it won't go to
> console
> >>> >>> at least we can debug when there is a failure.
> >>> >>>
> >>> >>> Raul has been helping Bill with reviews for the jetty server
> support
> >>> >>> and it looks like that should be ready soon.
> >>> >>>
> >>> >>> Raul also requested that someone prioritize reviewing
> "ZOOKEEPER-1919
> >>> >>> Update the C implementation of removeWatches to have it match
> >>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
> >>> >>>
> >>> >>> Hongchao got a patch in to cleanup the flakey c client reconfig
> test -
> >>> >>> kudos on helping cleanup the build/test infra!
> >>> >>>
> >>> >>>
> >>> >>> Based on previous comments it looks like we're pretty close. Do
> folks
> >>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a few
> pending
> >>> >>> as above)
> >>> >>>
> >>> >>> Patrick
> >>> >>>
> >>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> >>> >>> <rg...@itevenworks.net> wrote:
> >>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> >>> <fp...@yahoo.com.invalid>
> >>> >>>> wrote:
> >>> >>>>>
> >>> >>>>> Just so that we don´t delay too much, what if we release an alpha
> >>> version
> >>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
> >>> >>>>>
> >>> >>>>
> >>> >>>> +1
> >>> >>>>
> >>> >>>> -rgs
> >>> >>>>
> >>> >>>>> -Flavio
> >>> >>>>>
> >>> >>>>>
> >>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
> >>> >>>> rgs@itevenworks.net> wrote:
> >>> >>>>>
> >>> >>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
> >>> >>>>>>
> >>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last
> >>> check).
> >>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is
> threatening
> >>> to
> >>> >>>>>>> commit 1863. I see some great progress in general on the patch
> >>> >>>>>>> availables queue, which is great to see.
> >>> >>>>>>>
> >>> >>>>>>> So here's something else we might consider - should we drop
> jdk6
> >>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect
> some
> >>> >>>>>>> folks are still using ZK with 6. We gotta move forward though,
> >>> can't
> >>> >>>>>>> support it forever. Thoughts? Note that we are currently
> >>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >>> >>>>>>>
> >>> >>>>>>
> >>> >>>>>> Extra eyes/review for
> >>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >>> >>>>>> would be appreciated (otherwise anyone using Observers with the
> >>> upcoming
> >>> >>>>>> alpha release will see there network usage go wild...).
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>> -rgs
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>>> Patrick
> >>> >>>>>>>
> >>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >>> >>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
> >>> >>>> there. I
> >>> >>>>>>> think Michi hasn't checked in because LETest failed in the
> last QA
> >>> run
> >>> >>>>>>> there. However, that patch doesn't affect LETest, and in fact
> it
> >>> fails
> >>> >>>> in
> >>> >>>>>>> trunk intermittently, so the test failure doesn't seem to be
> >>> related
> >>> >>>> to the
> >>> >>>>>>> patch.
> >>> >>>>>>>>
> >>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete
> about
> >>> it.
> >>> >>>>>>>>
> >>> >>>>>>>> -Flavio
> >>> >>>>>>>>
> >>> >>>>>>>>
> >>> >>>>>>>>
> >>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <
> phunt@apache.org>
> >>> >>>> wrote:
> >>> >>>>>>>>
> >>> >>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>>> Hi Flavio, do you think those jiras can get
> reviewed/finalized
> >>> before
> >>> >>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
> >>> >>>>>>>>>
> >>> >>>>>>>>> Patrick
> >>> >>>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
> >>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >>> >>>>>>>>>>
> >>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are
> both
> >>> >>>> patch
> >>> >>>>>>> available. ZK-1870 is in trunk, but it is still open because we
> >>> need a
> >>> >>>> 3.4
> >>> >>>>>>> patch.
> >>> >>>>>>>>>>
> >>> >>>>>>>>>> -Flavio
> >>> >>>>>>>>>>
> >>> >>>>>>>>>>
> >>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org>
> >>> wrote:
> >>> >>>>>>>>>>
> >>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a few
> >>> people
> >>> >>>> have
> >>> >>>>>>>>>>> mentioned on the list as well as contacted me personally
> that
> >>> they
> >>> >>>>>>>>>>> would like to see some progress on the first 3.5 release.
> Every
> >>> >>>>>>>>>>> release is a compromise, if we wait for perfection we'll
> never
> >>> get
> >>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new features,
> >>> lots of
> >>> >>>>>>>>>>> hard work, let's get it out in a release so that folks can
> use
> >>> it,
> >>> >>>>>>>>>>> test it, and give feedback.
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the known
> >>> flakey
> >>> >>>> test
> >>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note
> that
> >>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> Here's my thinking again on how we should plan our
> releases:
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some
> time.
> >>> >>>> What I
> >>> >>>>>>>>>>> think we should do instead is similar to what we did for
> 3.4.
> >>> >>>> (this is
> >>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2
> release
> >>> >>>> cycle)
> >>> >>>>>>>>>>> Start with a series of alpha releases, something people
> can run
> >>> >>>> and
> >>> >>>>>>>>>>> test with, once we address all the blockers and feel
> >>> comfortable
> >>> >>>> with
> >>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we
> get
> >>> >>>> some
> >>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at
> >>> making
> >>> >>>> it
> >>> >>>>>>>>>>> "stable'. At some later point it will become the
> >>> "current/stable"
> >>> >>>>>>>>>>> release, taking over from 3.4.x.
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> e.g.
> >>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
> >>> >>>>>>>>>>> 3.5.1-alpha (3 blockers)
> >>> >>>>>>>>>>> 3.5.2-alpha (0 blockers)
> >>> >>>>>>>>>>> 3.5.3-beta (apis locked)
> >>> >>>>>>>>>>> 3.5.4-beta
> >>> >>>>>>>>>>> 3.5.5-beta
> >>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not
> "stable" vs
> >>> >>>> 3.4.x,
> >>> >>>>>>>>>>> maybe use it for production but we still expect things to
> shake
> >>> >>>> out)
> >>> >>>>>>>>>>> 3.5.7
> >>> >>>>>>>>>>> ....
> >>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use,
> >>> stable,
> >>> >>>>>>> etc...
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> There are 8 blockers currently, are any of these something
> that
> >>> >>>> should
> >>> >>>>>>>>>>> hold up 3.5.0-alpha?
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If folks
> find
> >>> >>>> this a
> >>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
> >>> >>>>>>>>>>>
> >>> >>>>>>>>>>> Patrick
> >>> >>>>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>>>
> >>> >>>>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >>>>>>
> >>> >
> >>>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
You may notice some back/forth on Apache Jenkins ZK jobs - I'm trying
to fix some of the jobs that were broken during the recent host
upgrade.

Patrick

On Thu, Jul 17, 2014 at 1:47 PM, Michi Mutsuzaki <mi...@cs.stanford.edu> wrote:
> I'll check in ZOOKEEPER-1683.
>
> On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer <sh...@gmail.com> wrote:
>> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all subsequent
>> changes were formatting as suggested by Rakesh.
>>
>>
>> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> I'm concerned that the CI tests are all failing due to, for e.g.
>>> findbugs issues. At the very least our build/test/ci should be pretty
>>> clean - some flakeys is ok (the recent startServer fix and some other
>>> flakeys that have been addressed go a long way on that issue) but I
>>> think the findbugs problem should be cleaned up before we cut a
>>> release. I started a separate thread to discuss the findbugs issue.
>>>
>>> Otw we seem to be in ok shape - 1863 is in.
>>>
>>> Anyone have a chance to give feedback to Raul on 1919?
>>>
>>> Patrick
>>>
>>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>>> <fp...@yahoo.com.invalid> wrote:
>>> > My take:
>>> >
>>> > - ZK-1863 is pending review. It is a blocker and it can go in. See the
>>> jira for comments.
>>> > - We can try to have ZK-1807 in for the first alpha.
>>> > - I'd rather not have the first alpha depending on ZK-1919 and ZK-1910,
>>> we can leave it for the second alpha.
>>> >
>>> > If you agree with this, then we should be able to cut a candidate by the
>>> end of this week.
>>> >
>>> > -Flavio
>>> >
>>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
>>> >
>>> >> Per my previous note you can now see the c client test log output here
>>> >> in the "build artifacts" section:
>>> >>
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
>>> >>
>>> >> Patrick
>>> >>
>>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
>>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
>>> >>> one(s?) is new?)
>>> >>>
>>> >>> Looks like the autoconf issue I reported is hitting the upgraded
>>> >>> apache jenkins instances as well. I've updated the "archive" list to
>>> >>> include the c tests stdout redirect. So while it won't go to console
>>> >>> at least we can debug when there is a failure.
>>> >>>
>>> >>> Raul has been helping Bill with reviews for the jetty server support
>>> >>> and it looks like that should be ready soon.
>>> >>>
>>> >>> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
>>> >>> Update the C implementation of removeWatches to have it match
>>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>>> >>>
>>> >>> Hongchao got a patch in to cleanup the flakey c client reconfig test -
>>> >>> kudos on helping cleanup the build/test infra!
>>> >>>
>>> >>>
>>> >>> Based on previous comments it looks like we're pretty close. Do folks
>>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
>>> >>> as above)
>>> >>>
>>> >>> Patrick
>>> >>>
>>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>>> >>> <rg...@itevenworks.net> wrote:
>>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>>> <fp...@yahoo.com.invalid>
>>> >>>> wrote:
>>> >>>>>
>>> >>>>> Just so that we don´t delay too much, what if we release an alpha
>>> version
>>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>>> >>>>>
>>> >>>>
>>> >>>> +1
>>> >>>>
>>> >>>> -rgs
>>> >>>>
>>> >>>>> -Flavio
>>> >>>>>
>>> >>>>>
>>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>>> >>>> rgs@itevenworks.net> wrote:
>>> >>>>>
>>> >>>>>
>>> >>>>>>
>>> >>>>>>
>>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>>> >>>>>>
>>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last
>>> check).
>>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is threatening
>>> to
>>> >>>>>>> commit 1863. I see some great progress in general on the patch
>>> >>>>>>> availables queue, which is great to see.
>>> >>>>>>>
>>> >>>>>>> So here's something else we might consider - should we drop jdk6
>>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect some
>>> >>>>>>> folks are still using ZK with 6. We gotta move forward though,
>>> can't
>>> >>>>>>> support it forever. Thoughts? Note that we are currently
>>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>> >>>>>>>
>>> >>>>>>
>>> >>>>>> Extra eyes/review for
>>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>> >>>>>> would be appreciated (otherwise anyone using Observers with the
>>> upcoming
>>> >>>>>> alpha release will see there network usage go wild...).
>>> >>>>>>
>>> >>>>>>
>>> >>>>>> -rgs
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>> Patrick
>>> >>>>>>>
>>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>>> >>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
>>> >>>> there. I
>>> >>>>>>> think Michi hasn't checked in because LETest failed in the last QA
>>> run
>>> >>>>>>> there. However, that patch doesn't affect LETest, and in fact it
>>> fails
>>> >>>> in
>>> >>>>>>> trunk intermittently, so the test failure doesn't seem to be
>>> related
>>> >>>> to the
>>> >>>>>>> patch.
>>> >>>>>>>>
>>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete about
>>> it.
>>> >>>>>>>>
>>> >>>>>>>> -Flavio
>>> >>>>>>>>
>>> >>>>>>>>
>>> >>>>>>>>
>>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
>>> >>>> wrote:
>>> >>>>>>>>
>>> >>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>> Hi Flavio, do you think those jiras can get reviewed/finalized
>>> before
>>> >>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
>>> >>>>>>>>>
>>> >>>>>>>>> Patrick
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>>> >>>>>>>>>>
>>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>>> >>>> patch
>>> >>>>>>> available. ZK-1870 is in trunk, but it is still open because we
>>> need a
>>> >>>> 3.4
>>> >>>>>>> patch.
>>> >>>>>>>>>>
>>> >>>>>>>>>> -Flavio
>>> >>>>>>>>>>
>>> >>>>>>>>>>
>>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org>
>>> wrote:
>>> >>>>>>>>>>
>>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a few
>>> people
>>> >>>> have
>>> >>>>>>>>>>> mentioned on the list as well as contacted me personally that
>>> they
>>> >>>>>>>>>>> would like to see some progress on the first 3.5 release. Every
>>> >>>>>>>>>>> release is a compromise, if we wait for perfection we'll never
>>> get
>>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new features,
>>> lots of
>>> >>>>>>>>>>> hard work, let's get it out in a release so that folks can use
>>> it,
>>> >>>>>>>>>>> test it, and give feedback.
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the known
>>> flakey
>>> >>>> test
>>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> Here's my thinking again on how we should plan our releases:
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some time.
>>> >>>> What I
>>> >>>>>>>>>>> think we should do instead is similar to what we did for 3.4.
>>> >>>> (this is
>>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2 release
>>> >>>> cycle)
>>> >>>>>>>>>>> Start with a series of alpha releases, something people can run
>>> >>>> and
>>> >>>>>>>>>>> test with, once we address all the blockers and feel
>>> comfortable
>>> >>>> with
>>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we get
>>> >>>> some
>>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at
>>> making
>>> >>>> it
>>> >>>>>>>>>>> "stable'. At some later point it will become the
>>> "current/stable"
>>> >>>>>>>>>>> release, taking over from 3.4.x.
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> e.g.
>>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>> >>>>>>>>>>> 3.5.1-alpha (3 blockers)
>>> >>>>>>>>>>> 3.5.2-alpha (0 blockers)
>>> >>>>>>>>>>> 3.5.3-beta (apis locked)
>>> >>>>>>>>>>> 3.5.4-beta
>>> >>>>>>>>>>> 3.5.5-beta
>>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>>> >>>> 3.4.x,
>>> >>>>>>>>>>> maybe use it for production but we still expect things to shake
>>> >>>> out)
>>> >>>>>>>>>>> 3.5.7
>>> >>>>>>>>>>> ....
>>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use,
>>> stable,
>>> >>>>>>> etc...
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> There are 8 blockers currently, are any of these something that
>>> >>>> should
>>> >>>>>>>>>>> hold up 3.5.0-alpha?
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If folks find
>>> >>>> this a
>>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>>> >>>>>>>>>>>
>>> >>>>>>>>>>> Patrick
>>> >>>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>>>
>>> >>>>>>>
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>
>>> >
>>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Michi Mutsuzaki <mi...@cs.stanford.edu>.
I'll check in ZOOKEEPER-1683.

On Thu, Jul 17, 2014 at 11:20 AM, Alexander Shraer <sh...@gmail.com> wrote:
> can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all subsequent
> changes were formatting as suggested by Rakesh.
>
>
> On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org> wrote:
>
>> I'm concerned that the CI tests are all failing due to, for e.g.
>> findbugs issues. At the very least our build/test/ci should be pretty
>> clean - some flakeys is ok (the recent startServer fix and some other
>> flakeys that have been addressed go a long way on that issue) but I
>> think the findbugs problem should be cleaned up before we cut a
>> release. I started a separate thread to discuss the findbugs issue.
>>
>> Otw we seem to be in ok shape - 1863 is in.
>>
>> Anyone have a chance to give feedback to Raul on 1919?
>>
>> Patrick
>>
>> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
>> <fp...@yahoo.com.invalid> wrote:
>> > My take:
>> >
>> > - ZK-1863 is pending review. It is a blocker and it can go in. See the
>> jira for comments.
>> > - We can try to have ZK-1807 in for the first alpha.
>> > - I'd rather not have the first alpha depending on ZK-1919 and ZK-1910,
>> we can leave it for the second alpha.
>> >
>> > If you agree with this, then we should be able to cut a candidate by the
>> end of this week.
>> >
>> > -Flavio
>> >
>> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >> Per my previous note you can now see the c client test log output here
>> >> in the "build artifacts" section:
>> >>
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
>> >>
>> >> Patrick
>> >>
>> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
>> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
>> >>> one(s?) is new?)
>> >>>
>> >>> Looks like the autoconf issue I reported is hitting the upgraded
>> >>> apache jenkins instances as well. I've updated the "archive" list to
>> >>> include the c tests stdout redirect. So while it won't go to console
>> >>> at least we can debug when there is a failure.
>> >>>
>> >>> Raul has been helping Bill with reviews for the jetty server support
>> >>> and it looks like that should be ready soon.
>> >>>
>> >>> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
>> >>> Update the C implementation of removeWatches to have it match
>> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>> >>>
>> >>> Hongchao got a patch in to cleanup the flakey c client reconfig test -
>> >>> kudos on helping cleanup the build/test infra!
>> >>>
>> >>>
>> >>> Based on previous comments it looks like we're pretty close. Do folks
>> >>> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
>> >>> as above)
>> >>>
>> >>> Patrick
>> >>>
>> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>> >>> <rg...@itevenworks.net> wrote:
>> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
>> <fp...@yahoo.com.invalid>
>> >>>> wrote:
>> >>>>>
>> >>>>> Just so that we don´t delay too much, what if we release an alpha
>> version
>> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>> >>>>>
>> >>>>
>> >>>> +1
>> >>>>
>> >>>> -rgs
>> >>>>
>> >>>>> -Flavio
>> >>>>>
>> >>>>>
>> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>> >>>> rgs@itevenworks.net> wrote:
>> >>>>>
>> >>>>>
>> >>>>>>
>> >>>>>>
>> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>> >>>>>>
>> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last
>> check).
>> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is threatening
>> to
>> >>>>>>> commit 1863. I see some great progress in general on the patch
>> >>>>>>> availables queue, which is great to see.
>> >>>>>>>
>> >>>>>>> So here's something else we might consider - should we drop jdk6
>> >>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect some
>> >>>>>>> folks are still using ZK with 6. We gotta move forward though,
>> can't
>> >>>>>>> support it forever. Thoughts? Note that we are currently
>> >>>>>>> building/testing trunk against jdk6, 7 and 8.
>> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >>>>>>>
>> >>>>>>
>> >>>>>> Extra eyes/review for
>> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >>>>>> would be appreciated (otherwise anyone using Observers with the
>> upcoming
>> >>>>>> alpha release will see there network usage go wild...).
>> >>>>>>
>> >>>>>>
>> >>>>>> -rgs
>> >>>>>>
>> >>>>>>
>> >>>>>>
>> >>>>>>
>> >>>>>>
>> >>>>>>> Patrick
>> >>>>>>>
>> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> >>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
>> >>>> there. I
>> >>>>>>> think Michi hasn't checked in because LETest failed in the last QA
>> run
>> >>>>>>> there. However, that patch doesn't affect LETest, and in fact it
>> fails
>> >>>> in
>> >>>>>>> trunk intermittently, so the test failure doesn't seem to be
>> related
>> >>>> to the
>> >>>>>>> patch.
>> >>>>>>>>
>> >>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete about
>> it.
>> >>>>>>>>
>> >>>>>>>> -Flavio
>> >>>>>>>>
>> >>>>>>>>
>> >>>>>>>>
>> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
>> >>>> wrote:
>> >>>>>>>>
>> >>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>> Hi Flavio, do you think those jiras can get reviewed/finalized
>> before
>> >>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
>> >>>>>>>>>
>> >>>>>>>>> Patrick
>> >>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>> >>>>>>>>>> +1 for the plan of releasing alpha versions.
>> >>>>>>>>>>
>> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>> >>>> patch
>> >>>>>>> available. ZK-1870 is in trunk, but it is still open because we
>> need a
>> >>>> 3.4
>> >>>>>>> patch.
>> >>>>>>>>>>
>> >>>>>>>>>> -Flavio
>> >>>>>>>>>>
>> >>>>>>>>>>
>> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org>
>> wrote:
>> >>>>>>>>>>
>> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a few
>> people
>> >>>> have
>> >>>>>>>>>>> mentioned on the list as well as contacted me personally that
>> they
>> >>>>>>>>>>> would like to see some progress on the first 3.5 release. Every
>> >>>>>>>>>>> release is a compromise, if we wait for perfection we'll never
>> get
>> >>>>>>>>>>> anything out the door. 3.5 has tons of great new features,
>> lots of
>> >>>>>>>>>>> hard work, let's get it out in a release so that folks can use
>> it,
>> >>>>>>>>>>> test it, and give feedback.
>> >>>>>>>>>>>
>> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the known
>> flakey
>> >>>> test
>> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >>>>>>>>>>>
>> >>>>>>>>>>> Here's my thinking again on how we should plan our releases:
>> >>>>>>>>>>>
>> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some time.
>> >>>> What I
>> >>>>>>>>>>> think we should do instead is similar to what we did for 3.4.
>> >>>> (this is
>> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2 release
>> >>>> cycle)
>> >>>>>>>>>>> Start with a series of alpha releases, something people can run
>> >>>> and
>> >>>>>>>>>>> test with, once we address all the blockers and feel
>> comfortable
>> >>>> with
>> >>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we get
>> >>>> some
>> >>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at
>> making
>> >>>> it
>> >>>>>>>>>>> "stable'. At some later point it will become the
>> "current/stable"
>> >>>>>>>>>>> release, taking over from 3.4.x.
>> >>>>>>>>>>>
>> >>>>>>>>>>> e.g.
>> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
>> >>>>>>>>>>> 3.5.1-alpha (3 blockers)
>> >>>>>>>>>>> 3.5.2-alpha (0 blockers)
>> >>>>>>>>>>> 3.5.3-beta (apis locked)
>> >>>>>>>>>>> 3.5.4-beta
>> >>>>>>>>>>> 3.5.5-beta
>> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>> >>>> 3.4.x,
>> >>>>>>>>>>> maybe use it for production but we still expect things to shake
>> >>>> out)
>> >>>>>>>>>>> 3.5.7
>> >>>>>>>>>>> ....
>> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use,
>> stable,
>> >>>>>>> etc...
>> >>>>>>>>>>>
>> >>>>>>>>>>> There are 8 blockers currently, are any of these something that
>> >>>> should
>> >>>>>>>>>>> hold up 3.5.0-alpha?
>> >>>>>>>>>>>
>> >>>>>>>>>>> I'll hold open the discussion for a couple days. If folks find
>> >>>> this a
>> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>> >>>>>>>>>>>
>> >>>>>>>>>>> Patrick
>> >>>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>
>> >>>>>>
>> >>>>>>
>> >>>>>>
>> >
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
can we also have ZOOKEEPER-1683 in ? Camille gave a +1 and all subsequent
changes were formatting as suggested by Rakesh.


On Thu, Jul 17, 2014 at 9:48 AM, Patrick Hunt <ph...@apache.org> wrote:

> I'm concerned that the CI tests are all failing due to, for e.g.
> findbugs issues. At the very least our build/test/ci should be pretty
> clean - some flakeys is ok (the recent startServer fix and some other
> flakeys that have been addressed go a long way on that issue) but I
> think the findbugs problem should be cleaned up before we cut a
> release. I started a separate thread to discuss the findbugs issue.
>
> Otw we seem to be in ok shape - 1863 is in.
>
> Anyone have a chance to give feedback to Raul on 1919?
>
> Patrick
>
> On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
> <fp...@yahoo.com.invalid> wrote:
> > My take:
> >
> > - ZK-1863 is pending review. It is a blocker and it can go in. See the
> jira for comments.
> > - We can try to have ZK-1807 in for the first alpha.
> > - I'd rather not have the first alpha depending on ZK-1919 and ZK-1910,
> we can leave it for the second alpha.
> >
> > If you agree with this, then we should be able to cut a candidate by the
> end of this week.
> >
> > -Flavio
> >
> > On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
> >
> >> Per my previous note you can now see the c client test log output here
> >> in the "build artifacts" section:
> >>
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
> >>
> >> Patrick
> >>
> >> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
> >>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
> >>> one(s?) is new?)
> >>>
> >>> Looks like the autoconf issue I reported is hitting the upgraded
> >>> apache jenkins instances as well. I've updated the "archive" list to
> >>> include the c tests stdout redirect. So while it won't go to console
> >>> at least we can debug when there is a failure.
> >>>
> >>> Raul has been helping Bill with reviews for the jetty server support
> >>> and it looks like that should be ready soon.
> >>>
> >>> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
> >>> Update the C implementation of removeWatches to have it match
> >>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
> >>>
> >>> Hongchao got a patch in to cleanup the flakey c client reconfig test -
> >>> kudos on helping cleanup the build/test infra!
> >>>
> >>>
> >>> Based on previous comments it looks like we're pretty close. Do folks
> >>> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
> >>> as above)
> >>>
> >>> Patrick
> >>>
> >>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> >>> <rg...@itevenworks.net> wrote:
> >>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira"
> <fp...@yahoo.com.invalid>
> >>>> wrote:
> >>>>>
> >>>>> Just so that we don´t delay too much, what if we release an alpha
> version
> >>>> without 1863 and 1807, and do another one in 2-3 weeks time?
> >>>>>
> >>>>
> >>>> +1
> >>>>
> >>>> -rgs
> >>>>
> >>>>> -Flavio
> >>>>>
> >>>>>
> >>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
> >>>> rgs@itevenworks.net> wrote:
> >>>>>
> >>>>>
> >>>>>>
> >>>>>>
> >>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
> >>>>>>
> >>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last
> check).
> >>>>>>> 1810 is waiting on feedback from Michi, and Camille is threatening
> to
> >>>>>>> commit 1863. I see some great progress in general on the patch
> >>>>>>> availables queue, which is great to see.
> >>>>>>>
> >>>>>>> So here's something else we might consider - should we drop jdk6
> >>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect some
> >>>>>>> folks are still using ZK with 6. We gotta move forward though,
> can't
> >>>>>>> support it forever. Thoughts? Note that we are currently
> >>>>>>> building/testing trunk against jdk6, 7 and 8.
> >>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >>>>>>>
> >>>>>>
> >>>>>> Extra eyes/review for
> >>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >>>>>> would be appreciated (otherwise anyone using Observers with the
> upcoming
> >>>>>> alpha release will see there network usage go wild...).
> >>>>>>
> >>>>>>
> >>>>>> -rgs
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>> Patrick
> >>>>>>>
> >>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >>>>>>> <fp...@yahoo.com.invalid> wrote:
> >>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
> >>>> there. I
> >>>>>>> think Michi hasn't checked in because LETest failed in the last QA
> run
> >>>>>>> there. However, that patch doesn't affect LETest, and in fact it
> fails
> >>>> in
> >>>>>>> trunk intermittently, so the test failure doesn't seem to be
> related
> >>>> to the
> >>>>>>> patch.
> >>>>>>>>
> >>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete about
> it.
> >>>>>>>>
> >>>>>>>> -Flavio
> >>>>>>>>
> >>>>>>>>
> >>>>>>>>
> >>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
> >>>> wrote:
> >>>>>>>>
> >>>>>>>>
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> Hi Flavio, do you think those jiras can get reviewed/finalized
> before
> >>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
> >>>>>>>>>
> >>>>>>>>> Patrick
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >>>>>>>>> <fp...@yahoo.com.invalid> wrote:
> >>>>>>>>>> +1 for the plan of releasing alpha versions.
> >>>>>>>>>>
> >>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
> >>>> patch
> >>>>>>> available. ZK-1870 is in trunk, but it is still open because we
> need a
> >>>> 3.4
> >>>>>>> patch.
> >>>>>>>>>>
> >>>>>>>>>> -Flavio
> >>>>>>>>>>
> >>>>>>>>>>
> >>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org>
> wrote:
> >>>>>>>>>>
> >>>>>>>>>>> Hey folks, we've been talking about it for a while, a few
> people
> >>>> have
> >>>>>>>>>>> mentioned on the list as well as contacted me personally that
> they
> >>>>>>>>>>> would like to see some progress on the first 3.5 release. Every
> >>>>>>>>>>> release is a compromise, if we wait for perfection we'll never
> get
> >>>>>>>>>>> anything out the door. 3.5 has tons of great new features,
> lots of
> >>>>>>>>>>> hard work, let's get it out in a release so that folks can use
> it,
> >>>>>>>>>>> test it, and give feedback.
> >>>>>>>>>>>
> >>>>>>>>>>> Jenkins jobs have been pretty stable except for the known
> flakey
> >>>> test
> >>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> >>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
> >>>>>>>>>>>
> >>>>>>>>>>> Here's my thinking again on how we should plan our releases:
> >>>>>>>>>>>
> >>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some time.
> >>>> What I
> >>>>>>>>>>> think we should do instead is similar to what we did for 3.4.
> >>>> (this is
> >>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2 release
> >>>> cycle)
> >>>>>>>>>>> Start with a series of alpha releases, something people can run
> >>>> and
> >>>>>>>>>>> test with, once we address all the blockers and feel
> comfortable
> >>>> with
> >>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we get
> >>>> some
> >>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at
> making
> >>>> it
> >>>>>>>>>>> "stable'. At some later point it will become the
> "current/stable"
> >>>>>>>>>>> release, taking over from 3.4.x.
> >>>>>>>>>>>
> >>>>>>>>>>> e.g.
> >>>>>>>>>>> 3.5.0-alpha (8 blockers)
> >>>>>>>>>>> 3.5.1-alpha (3 blockers)
> >>>>>>>>>>> 3.5.2-alpha (0 blockers)
> >>>>>>>>>>> 3.5.3-beta (apis locked)
> >>>>>>>>>>> 3.5.4-beta
> >>>>>>>>>>> 3.5.5-beta
> >>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
> >>>> 3.4.x,
> >>>>>>>>>>> maybe use it for production but we still expect things to shake
> >>>> out)
> >>>>>>>>>>> 3.5.7
> >>>>>>>>>>> ....
> >>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use,
> stable,
> >>>>>>> etc...
> >>>>>>>>>>>
> >>>>>>>>>>> There are 8 blockers currently, are any of these something that
> >>>> should
> >>>>>>>>>>> hold up 3.5.0-alpha?
> >>>>>>>>>>>
> >>>>>>>>>>> I'll hold open the discussion for a couple days. If folks find
> >>>> this a
> >>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
> >>>>>>>>>>>
> >>>>>>>>>>> Patrick
> >>>>>>>>>>
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>>>
> >>>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I'm concerned that the CI tests are all failing due to, for e.g.
findbugs issues. At the very least our build/test/ci should be pretty
clean - some flakeys is ok (the recent startServer fix and some other
flakeys that have been addressed go a long way on that issue) but I
think the findbugs problem should be cleaned up before we cut a
release. I started a separate thread to discuss the findbugs issue.

Otw we seem to be in ok shape - 1863 is in.

Anyone have a chance to give feedback to Raul on 1919?

Patrick

On Tue, Jul 15, 2014 at 10:34 AM, Flavio Junqueira
<fp...@yahoo.com.invalid> wrote:
> My take:
>
> - ZK-1863 is pending review. It is a blocker and it can go in. See the jira for comments.
> - We can try to have ZK-1807 in for the first alpha.
> - I'd rather not have the first alpha depending on ZK-1919 and ZK-1910, we can leave it for the second alpha.
>
> If you agree with this, then we should be able to cut a candidate by the end of this week.
>
> -Flavio
>
> On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:
>
>> Per my previous note you can now see the c client test log output here
>> in the "build artifacts" section:
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
>>
>> Patrick
>>
>> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
>>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
>>> one(s?) is new?)
>>>
>>> Looks like the autoconf issue I reported is hitting the upgraded
>>> apache jenkins instances as well. I've updated the "archive" list to
>>> include the c tests stdout redirect. So while it won't go to console
>>> at least we can debug when there is a failure.
>>>
>>> Raul has been helping Bill with reviews for the jetty server support
>>> and it looks like that should be ready soon.
>>>
>>> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
>>> Update the C implementation of removeWatches to have it match
>>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>>>
>>> Hongchao got a patch in to cleanup the flakey c client reconfig test -
>>> kudos on helping cleanup the build/test infra!
>>>
>>>
>>> Based on previous comments it looks like we're pretty close. Do folks
>>> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
>>> as above)
>>>
>>> Patrick
>>>
>>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>>> <rg...@itevenworks.net> wrote:
>>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira" <fp...@yahoo.com.invalid>
>>>> wrote:
>>>>>
>>>>> Just so that we don´t delay too much, what if we release an alpha version
>>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>>>>>
>>>>
>>>> +1
>>>>
>>>> -rgs
>>>>
>>>>> -Flavio
>>>>>
>>>>>
>>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>>>> rgs@itevenworks.net> wrote:
>>>>>
>>>>>
>>>>>>
>>>>>>
>>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>>>>>>
>>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>>>>>>> 1810 is waiting on feedback from Michi, and Camille is threatening to
>>>>>>> commit 1863. I see some great progress in general on the patch
>>>>>>> availables queue, which is great to see.
>>>>>>>
>>>>>>> So here's something else we might consider - should we drop jdk6
>>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect some
>>>>>>> folks are still using ZK with 6. We gotta move forward though, can't
>>>>>>> support it forever. Thoughts? Note that we are currently
>>>>>>> building/testing trunk against jdk6, 7 and 8.
>>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>>>>>>
>>>>>>
>>>>>> Extra eyes/review for
>>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>>>>> would be appreciated (otherwise anyone using Observers with the upcoming
>>>>>> alpha release will see there network usage go wild...).
>>>>>>
>>>>>>
>>>>>> -rgs
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>> Patrick
>>>>>>>
>>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
>>>> there. I
>>>>>>> think Michi hasn't checked in because LETest failed in the last QA run
>>>>>>> there. However, that patch doesn't affect LETest, and in fact it fails
>>>> in
>>>>>>> trunk intermittently, so the test failure doesn't seem to be related
>>>> to the
>>>>>>> patch.
>>>>>>>>
>>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete about it.
>>>>>>>>
>>>>>>>> -Flavio
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
>>>> wrote:
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Hi Flavio, do you think those jiras can get reviewed/finalized before
>>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
>>>>>>>>>
>>>>>>>>> Patrick
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>>>>>>>> +1 for the plan of releasing alpha versions.
>>>>>>>>>>
>>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>>>> patch
>>>>>>> available. ZK-1870 is in trunk, but it is still open because we need a
>>>> 3.4
>>>>>>> patch.
>>>>>>>>>>
>>>>>>>>>> -Flavio
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hey folks, we've been talking about it for a while, a few people
>>>> have
>>>>>>>>>>> mentioned on the list as well as contacted me personally that they
>>>>>>>>>>> would like to see some progress on the first 3.5 release. Every
>>>>>>>>>>> release is a compromise, if we wait for perfection we'll never get
>>>>>>>>>>> anything out the door. 3.5 has tons of great new features, lots of
>>>>>>>>>>> hard work, let's get it out in a release so that folks can use it,
>>>>>>>>>>> test it, and give feedback.
>>>>>>>>>>>
>>>>>>>>>>> Jenkins jobs have been pretty stable except for the known flakey
>>>> test
>>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>>>>>>>>>
>>>>>>>>>>> Here's my thinking again on how we should plan our releases:
>>>>>>>>>>>
>>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some time.
>>>> What I
>>>>>>>>>>> think we should do instead is similar to what we did for 3.4.
>>>> (this is
>>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2 release
>>>> cycle)
>>>>>>>>>>> Start with a series of alpha releases, something people can run
>>>> and
>>>>>>>>>>> test with, once we address all the blockers and feel comfortable
>>>> with
>>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we get
>>>> some
>>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at making
>>>> it
>>>>>>>>>>> "stable'. At some later point it will become the "current/stable"
>>>>>>>>>>> release, taking over from 3.4.x.
>>>>>>>>>>>
>>>>>>>>>>> e.g.
>>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>>>>>>>>>> 3.5.1-alpha (3 blockers)
>>>>>>>>>>> 3.5.2-alpha (0 blockers)
>>>>>>>>>>> 3.5.3-beta (apis locked)
>>>>>>>>>>> 3.5.4-beta
>>>>>>>>>>> 3.5.5-beta
>>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>>>> 3.4.x,
>>>>>>>>>>> maybe use it for production but we still expect things to shake
>>>> out)
>>>>>>>>>>> 3.5.7
>>>>>>>>>>> ....
>>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>>>>>>> etc...
>>>>>>>>>>>
>>>>>>>>>>> There are 8 blockers currently, are any of these something that
>>>> should
>>>>>>>>>>> hold up 3.5.0-alpha?
>>>>>>>>>>>
>>>>>>>>>>> I'll hold open the discussion for a couple days. If folks find
>>>> this a
>>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>>>>>>>>>
>>>>>>>>>>> Patrick
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Flavio Junqueira <fp...@yahoo.com.INVALID>.
My take:

- ZK-1863 is pending review. It is a blocker and it can go in. See the jira for comments.
- We can try to have ZK-1807 in for the first alpha.
- I'd rather not have the first alpha depending on ZK-1919 and ZK-1910, we can leave it for the second alpha.

If you agree with this, then we should be able to cut a candidate by the end of this week.

-Flavio

On 15 Jul 2014, at 17:26, Patrick Hunt <ph...@apache.org> wrote:

> Per my previous note you can now see the c client test log output here
> in the "build artifacts" section:
> https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/
> 
> Patrick
> 
> On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
>> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
>> one(s?) is new?)
>> 
>> Looks like the autoconf issue I reported is hitting the upgraded
>> apache jenkins instances as well. I've updated the "archive" list to
>> include the c tests stdout redirect. So while it won't go to console
>> at least we can debug when there is a failure.
>> 
>> Raul has been helping Bill with reviews for the jetty server support
>> and it looks like that should be ready soon.
>> 
>> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
>> Update the C implementation of removeWatches to have it match
>> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>> 
>> Hongchao got a patch in to cleanup the flakey c client reconfig test -
>> kudos on helping cleanup the build/test infra!
>> 
>> 
>> Based on previous comments it looks like we're pretty close. Do folks
>> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
>> as above)
>> 
>> Patrick
>> 
>> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
>> <rg...@itevenworks.net> wrote:
>>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira" <fp...@yahoo.com.invalid>
>>> wrote:
>>>> 
>>>> Just so that we don´t delay too much, what if we release an alpha version
>>> without 1863 and 1807, and do another one in 2-3 weeks time?
>>>> 
>>> 
>>> +1
>>> 
>>> -rgs
>>> 
>>>> -Flavio
>>>> 
>>>> 
>>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>>> rgs@itevenworks.net> wrote:
>>>> 
>>>> 
>>>>> 
>>>>> 
>>>>> On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>>>>> 
>>>>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>>>>>> 1810 is waiting on feedback from Michi, and Camille is threatening to
>>>>>> commit 1863. I see some great progress in general on the patch
>>>>>> availables queue, which is great to see.
>>>>>> 
>>>>>> So here's something else we might consider - should we drop jdk6
>>>>>> support from 3.5. It's long since EOL by Oracle but I suspect some
>>>>>> folks are still using ZK with 6. We gotta move forward though, can't
>>>>>> support it forever. Thoughts? Note that we are currently
>>>>>> building/testing trunk against jdk6, 7 and 8.
>>>>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>>>>> 
>>>>> 
>>>>> Extra eyes/review for
>>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>>>> would be appreciated (otherwise anyone using Observers with the upcoming
>>>>> alpha release will see there network usage go wild...).
>>>>> 
>>>>> 
>>>>> -rgs
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>> 
>>>>>> Patrick
>>>>>> 
>>>>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>>>>> According to me, ZK-1810 should be in already, but I need a +1
>>> there. I
>>>>>> think Michi hasn't checked in because LETest failed in the last QA run
>>>>>> there. However, that patch doesn't affect LETest, and in fact it fails
>>> in
>>>>>> trunk intermittently, so the test failure doesn't seem to be related
>>> to the
>>>>>> patch.
>>>>>>> 
>>>>>>> I haven't checked ZK-1863, so I can't say anything concrete about it.
>>>>>>> 
>>>>>>> -Flavio
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
>>> wrote:
>>>>>>> 
>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>>> Hi Flavio, do you think those jiras can get reviewed/finalized before
>>>>>>>> the end of the week? I'd like to try cutting an RC soonish...
>>>>>>>> 
>>>>>>>> Patrick
>>>>>>>> 
>>>>>>>> 
>>>>>>>> On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>>>>>>> <fp...@yahoo.com.invalid> wrote:
>>>>>>>>> +1 for the plan of releasing alpha versions.
>>>>>>>>> 
>>>>>>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>>> patch
>>>>>> available. ZK-1870 is in trunk, but it is still open because we need a
>>> 3.4
>>>>>> patch.
>>>>>>>>> 
>>>>>>>>> -Flavio
>>>>>>>>> 
>>>>>>>>> 
>>>>>>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>>>>>>>> 
>>>>>>>>>> Hey folks, we've been talking about it for a while, a few people
>>> have
>>>>>>>>>> mentioned on the list as well as contacted me personally that they
>>>>>>>>>> would like to see some progress on the first 3.5 release. Every
>>>>>>>>>> release is a compromise, if we wait for perfection we'll never get
>>>>>>>>>> anything out the door. 3.5 has tons of great new features, lots of
>>>>>>>>>> hard work, let's get it out in a release so that folks can use it,
>>>>>>>>>> test it, and give feedback.
>>>>>>>>>> 
>>>>>>>>>> Jenkins jobs have been pretty stable except for the known flakey
>>> test
>>>>>>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>>>>>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>>>>>>>> 
>>>>>>>>>> Here's my thinking again on how we should plan our releases:
>>>>>>>>>> 
>>>>>>>>>> I don't think we'll be able to do a 3.5.x-stable for some time.
>>> What I
>>>>>>>>>> think we should do instead is similar to what we did for 3.4.
>>> (this is
>>>>>>>>>> also similar to what Hadoop did during their Hadoop 2 release
>>> cycle)
>>>>>>>>>> Start with a series of alpha releases, something people can run
>>> and
>>>>>>>>>> test with, once we address all the blockers and feel comfortable
>>> with
>>>>>>>>>> the apis & remaining jiras we then switch to beta. Once we get
>>> some
>>>>>>>>>> good feedback we remove the alpha/beta moniker and look at making
>>> it
>>>>>>>>>> "stable'. At some later point it will become the "current/stable"
>>>>>>>>>> release, taking over from 3.4.x.
>>>>>>>>>> 
>>>>>>>>>> e.g.
>>>>>>>>>> 3.5.0-alpha (8 blockers)
>>>>>>>>>> 3.5.1-alpha (3 blockers)
>>>>>>>>>> 3.5.2-alpha (0 blockers)
>>>>>>>>>> 3.5.3-beta (apis locked)
>>>>>>>>>> 3.5.4-beta
>>>>>>>>>> 3.5.5-beta
>>>>>>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>>> 3.4.x,
>>>>>>>>>> maybe use it for production but we still expect things to shake
>>> out)
>>>>>>>>>> 3.5.7
>>>>>>>>>> ....
>>>>>>>>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>>>>>> etc...
>>>>>>>>>> 
>>>>>>>>>> There are 8 blockers currently, are any of these something that
>>> should
>>>>>>>>>> hold up 3.5.0-alpha?
>>>>>>>>>> 
>>>>>>>>>> I'll hold open the discussion for a couple days. If folks find
>>> this a
>>>>>>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>>>>>>>> 
>>>>>>>>>> Patrick
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>>>> 
>>>>>> 
>>>>> 
>>>>> 
>>>>> 


Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Per my previous note you can now see the c client test log output here
in the "build artifacts" section:
https://builds.apache.org/view/S-Z/view/ZooKeeper/job/ZooKeeper-trunk/2372/

Patrick

On Mon, Jul 14, 2014 at 7:36 PM, Patrick Hunt <ph...@apache.org> wrote:
> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
> one(s?) is new?)
>
> Looks like the autoconf issue I reported is hitting the upgraded
> apache jenkins instances as well. I've updated the "archive" list to
> include the c tests stdout redirect. So while it won't go to console
> at least we can debug when there is a failure.
>
> Raul has been helping Bill with reviews for the jetty server support
> and it looks like that should be ready soon.
>
> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
> Update the C implementation of removeWatches to have it match
> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>
> Hongchao got a patch in to cleanup the flakey c client reconfig test -
> kudos on helping cleanup the build/test infra!
>
>
> Based on previous comments it looks like we're pretty close. Do folks
> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
> as above)
>
> Patrick
>
> On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
> <rg...@itevenworks.net> wrote:
>> On Jul 11, 2014 6:37 AM, "Flavio Junqueira" <fp...@yahoo.com.invalid>
>> wrote:
>>>
>>> Just so that we don´t delay too much, what if we release an alpha version
>> without 1863 and 1807, and do another one in 2-3 weeks time?
>>>
>>
>> +1
>>
>> -rgs
>>
>>> -Flavio
>>>
>>>
>>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
>> rgs@itevenworks.net> wrote:
>>>
>>>
>>> >
>>> >
>>> >On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>>> >
>>> >> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>>> >> 1810 is waiting on feedback from Michi, and Camille is threatening to
>>> >> commit 1863. I see some great progress in general on the patch
>>> >> availables queue, which is great to see.
>>> >>
>>> >> So here's something else we might consider - should we drop jdk6
>>> >> support from 3.5. It's long since EOL by Oracle but I suspect some
>>> >> folks are still using ZK with 6. We gotta move forward though, can't
>>> >> support it forever. Thoughts? Note that we are currently
>>> >> building/testing trunk against jdk6, 7 and 8.
>>> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>> >>
>>> >
>>> >Extra eyes/review for
>> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>> >would be appreciated (otherwise anyone using Observers with the upcoming
>>> >alpha release will see there network usage go wild...).
>>> >
>>> >
>>> >-rgs
>>> >
>>> >
>>> >
>>> >
>>> >
>>> >> Patrick
>>> >>
>>> >> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>> >> <fp...@yahoo.com.invalid> wrote:
>>> >> > According to me, ZK-1810 should be in already, but I need a +1
>> there. I
>>> >> think Michi hasn't checked in because LETest failed in the last QA run
>>> >> there. However, that patch doesn't affect LETest, and in fact it fails
>> in
>>> >> trunk intermittently, so the test failure doesn't seem to be related
>> to the
>>> >> patch.
>>> >> >
>>> >> > I haven't checked ZK-1863, so I can't say anything concrete about it.
>>> >> >
>>> >> > -Flavio
>>> >> >
>>> >> >
>>> >> >
>>> >> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
>> wrote:
>>> >> >
>>> >> >
>>> >> >>
>>> >> >>
>>> >> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
>>> >> >>the end of the week? I'd like to try cutting an RC soonish...
>>> >> >>
>>> >> >>Patrick
>>> >> >>
>>> >> >>
>>> >> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>> >> >><fp...@yahoo.com.invalid> wrote:
>>> >> >>> +1 for the plan of releasing alpha versions.
>>> >> >>>
>>> >> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
>> patch
>>> >> available. ZK-1870 is in trunk, but it is still open because we need a
>> 3.4
>>> >> patch.
>>> >> >>>
>>> >> >>> -Flavio
>>> >> >>>
>>> >> >>>
>>> >> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>> >> >>>
>>> >> >>>> Hey folks, we've been talking about it for a while, a few people
>> have
>>> >> >>>> mentioned on the list as well as contacted me personally that they
>>> >> >>>> would like to see some progress on the first 3.5 release. Every
>>> >> >>>> release is a compromise, if we wait for perfection we'll never get
>>> >> >>>> anything out the door. 3.5 has tons of great new features, lots of
>>> >> >>>> hard work, let's get it out in a release so that folks can use it,
>>> >> >>>> test it, and give feedback.
>>> >> >>>>
>>> >> >>>> Jenkins jobs have been pretty stable except for the known flakey
>> test
>>> >> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>> >> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>> >> >>>>
>>> >> >>>> Here's my thinking again on how we should plan our releases:
>>> >> >>>>
>>> >> >>>> I don't think we'll be able to do a 3.5.x-stable for some time.
>> What I
>>> >> >>>> think we should do instead is similar to what we did for 3.4.
>> (this is
>>> >> >>>> also similar to what Hadoop did during their Hadoop 2 release
>> cycle)
>>> >> >>>> Start with a series of alpha releases, something people can run
>> and
>>> >> >>>> test with, once we address all the blockers and feel comfortable
>> with
>>> >> >>>> the apis & remaining jiras we then switch to beta. Once we get
>> some
>>> >> >>>> good feedback we remove the alpha/beta moniker and look at making
>> it
>>> >> >>>> "stable'. At some later point it will become the "current/stable"
>>> >> >>>> release, taking over from 3.4.x.
>>> >> >>>>
>>> >> >>>> e.g.
>>> >> >>>> 3.5.0-alpha (8 blockers)
>>> >> >>>> 3.5.1-alpha (3 blockers)
>>> >> >>>> 3.5.2-alpha (0 blockers)
>>> >> >>>> 3.5.3-beta (apis locked)
>>> >> >>>> 3.5.4-beta
>>> >> >>>> 3.5.5-beta
>>> >> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
>> 3.4.x,
>>> >> >>>> maybe use it for production but we still expect things to shake
>> out)
>>> >> >>>> 3.5.7
>>> >> >>>> ....
>>> >> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>>> >> etc...
>>> >> >>>>
>>> >> >>>> There are 8 blockers currently, are any of these something that
>> should
>>> >> >>>> hold up 3.5.0-alpha?
>>> >> >>>>
>>> >> >>>> I'll hold open the discussion for a couple days. If folks find
>> this a
>>> >> >>>> reasonable plan I'll start the ball rolling to cut an RC.
>>> >> >>>>
>>> >> >>>> Patrick
>>> >> >>>
>>> >> >>
>>> >> >>
>>> >> >>
>>> >>
>>> >
>>> >
>>> >

Re: ZooKeeper 3.5.0-alpha planning

Posted by Alexander Shraer <sh...@gmail.com>.
Hi Guys,

Sorry for not making more progress on ZK-1807.  I support Raul's proposal -
how about getting the last patch
in - the code change basically reverts to what was there before ZK-107
changes, so solves the observer spamming. Besides that it adds two more
tests for reconfiguration. We can keep the JIRA open.

Alex




On Mon, Jul 14, 2014 at 8:00 PM, Raúl Gutiérrez Segalés <rgs@itevenworks.net
> wrote:

> On 14 July 2014 19:36, Patrick Hunt <ph...@apache.org> wrote:
>
> > Update: we're back to 8 blockers on 3.5.0 (not clear to me which
> > one(s?) is new?)
> >
> > Looks like the autoconf issue I reported is hitting the upgraded
> > apache jenkins instances as well. I've updated the "archive" list to
> > include the c tests stdout redirect. So while it won't go to console
> > at least we can debug when there is a failure.
> >
> > Raul has been helping Bill with reviews for the jetty server support
> > and it looks like that should be ready soon.
> >
> > Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
> > Update the C implementation of removeWatches to have it match
> > ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
> >
> > Hongchao got a patch in to cleanup the flakey c client reconfig test -
> > kudos on helping cleanup the build/test infra!
> >
> >
> > Based on previous comments it looks like we're pretty close. Do folks
> > feel comfortable with a 3.5.0 alpha at this point? (with a few pending
> > as above)
> >
>
> +1, though it would be really nice to have the proposed patch for
> ZOOKEEPER-1807 merged since otherwise I'll need to apply patches to the
> alpha release to be able to test it in my environment (with many
> observers).
>
>
> Cheers,
> -rgs
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 14 July 2014 19:36, Patrick Hunt <ph...@apache.org> wrote:

> Update: we're back to 8 blockers on 3.5.0 (not clear to me which
> one(s?) is new?)
>
> Looks like the autoconf issue I reported is hitting the upgraded
> apache jenkins instances as well. I've updated the "archive" list to
> include the c tests stdout redirect. So while it won't go to console
> at least we can debug when there is a failure.
>
> Raul has been helping Bill with reviews for the jetty server support
> and it looks like that should be ready soon.
>
> Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
> Update the C implementation of removeWatches to have it match
> ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?
>
> Hongchao got a patch in to cleanup the flakey c client reconfig test -
> kudos on helping cleanup the build/test infra!
>
>
> Based on previous comments it looks like we're pretty close. Do folks
> feel comfortable with a 3.5.0 alpha at this point? (with a few pending
> as above)
>

+1, though it would be really nice to have the proposed patch for
ZOOKEEPER-1807 merged since otherwise I'll need to apply patches to the
alpha release to be able to test it in my environment (with many observers).


Cheers,
-rgs

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Update: we're back to 8 blockers on 3.5.0 (not clear to me which
one(s?) is new?)

Looks like the autoconf issue I reported is hitting the upgraded
apache jenkins instances as well. I've updated the "archive" list to
include the c tests stdout redirect. So while it won't go to console
at least we can debug when there is a failure.

Raul has been helping Bill with reviews for the jetty server support
and it looks like that should be ready soon.

Raul also requested that someone prioritize reviewing "ZOOKEEPER-1919
Update the C implementation of removeWatches to have it match
ZOOKEEPER-1910" so that we can include it in 3.5.0. Flavio/Michi?

Hongchao got a patch in to cleanup the flakey c client reconfig test -
kudos on helping cleanup the build/test infra!


Based on previous comments it looks like we're pretty close. Do folks
feel comfortable with a 3.5.0 alpha at this point? (with a few pending
as above)

Patrick

On Fri, Jul 11, 2014 at 9:24 AM, Raúl Gutiérrez Segalés
<rg...@itevenworks.net> wrote:
> On Jul 11, 2014 6:37 AM, "Flavio Junqueira" <fp...@yahoo.com.invalid>
> wrote:
>>
>> Just so that we don´t delay too much, what if we release an alpha version
> without 1863 and 1807, and do another one in 2-3 weeks time?
>>
>
> +1
>
> -rgs
>
>> -Flavio
>>
>>
>> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
> rgs@itevenworks.net> wrote:
>>
>>
>> >
>> >
>> >On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>> >> 1810 is waiting on feedback from Michi, and Camille is threatening to
>> >> commit 1863. I see some great progress in general on the patch
>> >> availables queue, which is great to see.
>> >>
>> >> So here's something else we might consider - should we drop jdk6
>> >> support from 3.5. It's long since EOL by Oracle but I suspect some
>> >> folks are still using ZK with 6. We gotta move forward though, can't
>> >> support it forever. Thoughts? Note that we are currently
>> >> building/testing trunk against jdk6, 7 and 8.
>> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>> >>
>> >
>> >Extra eyes/review for
> https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>> >would be appreciated (otherwise anyone using Observers with the upcoming
>> >alpha release will see there network usage go wild...).
>> >
>> >
>> >-rgs
>> >
>> >
>> >
>> >
>> >
>> >> Patrick
>> >>
>> >> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> >> <fp...@yahoo.com.invalid> wrote:
>> >> > According to me, ZK-1810 should be in already, but I need a +1
> there. I
>> >> think Michi hasn't checked in because LETest failed in the last QA run
>> >> there. However, that patch doesn't affect LETest, and in fact it fails
> in
>> >> trunk intermittently, so the test failure doesn't seem to be related
> to the
>> >> patch.
>> >> >
>> >> > I haven't checked ZK-1863, so I can't say anything concrete about it.
>> >> >
>> >> > -Flavio
>> >> >
>> >> >
>> >> >
>> >> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
> wrote:
>> >> >
>> >> >
>> >> >>
>> >> >>
>> >> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
>> >> >>the end of the week? I'd like to try cutting an RC soonish...
>> >> >>
>> >> >>Patrick
>> >> >>
>> >> >>
>> >> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> >> >><fp...@yahoo.com.invalid> wrote:
>> >> >>> +1 for the plan of releasing alpha versions.
>> >> >>>
>> >> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
> patch
>> >> available. ZK-1870 is in trunk, but it is still open because we need a
> 3.4
>> >> patch.
>> >> >>>
>> >> >>> -Flavio
>> >> >>>
>> >> >>>
>> >> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>> >> >>>
>> >> >>>> Hey folks, we've been talking about it for a while, a few people
> have
>> >> >>>> mentioned on the list as well as contacted me personally that they
>> >> >>>> would like to see some progress on the first 3.5 release. Every
>> >> >>>> release is a compromise, if we wait for perfection we'll never get
>> >> >>>> anything out the door. 3.5 has tons of great new features, lots of
>> >> >>>> hard work, let's get it out in a release so that folks can use it,
>> >> >>>> test it, and give feedback.
>> >> >>>>
>> >> >>>> Jenkins jobs have been pretty stable except for the known flakey
> test
>> >> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> >> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >> >>>>
>> >> >>>> Here's my thinking again on how we should plan our releases:
>> >> >>>>
>> >> >>>> I don't think we'll be able to do a 3.5.x-stable for some time.
> What I
>> >> >>>> think we should do instead is similar to what we did for 3.4.
> (this is
>> >> >>>> also similar to what Hadoop did during their Hadoop 2 release
> cycle)
>> >> >>>> Start with a series of alpha releases, something people can run
> and
>> >> >>>> test with, once we address all the blockers and feel comfortable
> with
>> >> >>>> the apis & remaining jiras we then switch to beta. Once we get
> some
>> >> >>>> good feedback we remove the alpha/beta moniker and look at making
> it
>> >> >>>> "stable'. At some later point it will become the "current/stable"
>> >> >>>> release, taking over from 3.4.x.
>> >> >>>>
>> >> >>>> e.g.
>> >> >>>> 3.5.0-alpha (8 blockers)
>> >> >>>> 3.5.1-alpha (3 blockers)
>> >> >>>> 3.5.2-alpha (0 blockers)
>> >> >>>> 3.5.3-beta (apis locked)
>> >> >>>> 3.5.4-beta
>> >> >>>> 3.5.5-beta
>> >> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
> 3.4.x,
>> >> >>>> maybe use it for production but we still expect things to shake
> out)
>> >> >>>> 3.5.7
>> >> >>>> ....
>> >> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>> >> etc...
>> >> >>>>
>> >> >>>> There are 8 blockers currently, are any of these something that
> should
>> >> >>>> hold up 3.5.0-alpha?
>> >> >>>>
>> >> >>>> I'll hold open the discussion for a couple days. If folks find
> this a
>> >> >>>> reasonable plan I'll start the ball rolling to cut an RC.
>> >> >>>>
>> >> >>>> Patrick
>> >> >>>
>> >> >>
>> >> >>
>> >> >>
>> >>
>> >
>> >
>> >

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On Jul 11, 2014 6:37 AM, "Flavio Junqueira" <fp...@yahoo.com.invalid>
wrote:
>
> Just so that we don´t delay too much, what if we release an alpha version
without 1863 and 1807, and do another one in 2-3 weeks time?
>

+1

-rgs

> -Flavio
>
>
> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <
rgs@itevenworks.net> wrote:
>
>
> >
> >
> >On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
> >
> >> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
> >> 1810 is waiting on feedback from Michi, and Camille is threatening to
> >> commit 1863. I see some great progress in general on the patch
> >> availables queue, which is great to see.
> >>
> >> So here's something else we might consider - should we drop jdk6
> >> support from 3.5. It's long since EOL by Oracle but I suspect some
> >> folks are still using ZK with 6. We gotta move forward though, can't
> >> support it forever. Thoughts? Note that we are currently
> >> building/testing trunk against jdk6, 7 and 8.
> >> https://builds.apache.org/view/S-Z/view/ZooKeeper/
> >>
> >
> >Extra eyes/review for
https://issues.apache.org/jira/browse/ZOOKEEPER-1807
> >would be appreciated (otherwise anyone using Observers with the upcoming
> >alpha release will see there network usage go wild...).
> >
> >
> >-rgs
> >
> >
> >
> >
> >
> >> Patrick
> >>
> >> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> >> <fp...@yahoo.com.invalid> wrote:
> >> > According to me, ZK-1810 should be in already, but I need a +1
there. I
> >> think Michi hasn't checked in because LETest failed in the last QA run
> >> there. However, that patch doesn't affect LETest, and in fact it fails
in
> >> trunk intermittently, so the test failure doesn't seem to be related
to the
> >> patch.
> >> >
> >> > I haven't checked ZK-1863, so I can't say anything concrete about it.
> >> >
> >> > -Flavio
> >> >
> >> >
> >> >
> >> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org>
wrote:
> >> >
> >> >
> >> >>
> >> >>
> >> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
> >> >>the end of the week? I'd like to try cutting an RC soonish...
> >> >>
> >> >>Patrick
> >> >>
> >> >>
> >> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >> >><fp...@yahoo.com.invalid> wrote:
> >> >>> +1 for the plan of releasing alpha versions.
> >> >>>
> >> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both
patch
> >> available. ZK-1870 is in trunk, but it is still open because we need a
3.4
> >> patch.
> >> >>>
> >> >>> -Flavio
> >> >>>
> >> >>>
> >> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
> >> >>>
> >> >>>> Hey folks, we've been talking about it for a while, a few people
have
> >> >>>> mentioned on the list as well as contacted me personally that they
> >> >>>> would like to see some progress on the first 3.5 release. Every
> >> >>>> release is a compromise, if we wait for perfection we'll never get
> >> >>>> anything out the door. 3.5 has tons of great new features, lots of
> >> >>>> hard work, let's get it out in a release so that folks can use it,
> >> >>>> test it, and give feedback.
> >> >>>>
> >> >>>> Jenkins jobs have been pretty stable except for the known flakey
test
> >> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> >> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
> >> >>>>
> >> >>>> Here's my thinking again on how we should plan our releases:
> >> >>>>
> >> >>>> I don't think we'll be able to do a 3.5.x-stable for some time.
What I
> >> >>>> think we should do instead is similar to what we did for 3.4.
(this is
> >> >>>> also similar to what Hadoop did during their Hadoop 2 release
cycle)
> >> >>>> Start with a series of alpha releases, something people can run
and
> >> >>>> test with, once we address all the blockers and feel comfortable
with
> >> >>>> the apis & remaining jiras we then switch to beta. Once we get
some
> >> >>>> good feedback we remove the alpha/beta moniker and look at making
it
> >> >>>> "stable'. At some later point it will become the "current/stable"
> >> >>>> release, taking over from 3.4.x.
> >> >>>>
> >> >>>> e.g.
> >> >>>> 3.5.0-alpha (8 blockers)
> >> >>>> 3.5.1-alpha (3 blockers)
> >> >>>> 3.5.2-alpha (0 blockers)
> >> >>>> 3.5.3-beta (apis locked)
> >> >>>> 3.5.4-beta
> >> >>>> 3.5.5-beta
> >> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs
3.4.x,
> >> >>>> maybe use it for production but we still expect things to shake
out)
> >> >>>> 3.5.7
> >> >>>> ....
> >> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
> >> etc...
> >> >>>>
> >> >>>> There are 8 blockers currently, are any of these something that
should
> >> >>>> hold up 3.5.0-alpha?
> >> >>>>
> >> >>>> I'll hold open the discussion for a couple days. If folks find
this a
> >> >>>> reasonable plan I'll start the ball rolling to cut an RC.
> >> >>>>
> >> >>>> Patrick
> >> >>>
> >> >>
> >> >>
> >> >>
> >>
> >
> >
> >

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
I think that would be fine. It would also allow other people to more
easily try out the release and provide additional feedback.

Patrick

On Fri, Jul 11, 2014 at 6:37 AM, Flavio Junqueira
<fp...@yahoo.com.invalid> wrote:
> Just so that we don´t delay too much, what if we release an alpha version without 1863 and 1807, and do another one in 2-3 weeks time?
>
> -Flavio
>
>
> On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <rg...@itevenworks.net> wrote:
>
>
>>
>>
>>On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>>> 1810 is waiting on feedback from Michi, and Camille is threatening to
>>> commit 1863. I see some great progress in general on the patch
>>> availables queue, which is great to see.
>>>
>>> So here's something else we might consider - should we drop jdk6
>>> support from 3.5. It's long since EOL by Oracle but I suspect some
>>> folks are still using ZK with 6. We gotta move forward though, can't
>>> support it forever. Thoughts? Note that we are currently
>>> building/testing trunk against jdk6, 7 and 8.
>>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>>
>>
>>Extra eyes/review for https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>>would be appreciated (otherwise anyone using Observers with the upcoming
>>alpha release will see there network usage go wild...).
>>
>>
>>-rgs
>>
>>
>>
>>
>>
>>> Patrick
>>>
>>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>>> <fp...@yahoo.com.invalid> wrote:
>>> > According to me, ZK-1810 should be in already, but I need a +1 there. I
>>> think Michi hasn't checked in because LETest failed in the last QA run
>>> there. However, that patch doesn't affect LETest, and in fact it fails in
>>> trunk intermittently, so the test failure doesn't seem to be related to the
>>> patch.
>>> >
>>> > I haven't checked ZK-1863, so I can't say anything concrete about it.
>>> >
>>> > -Flavio
>>> >
>>> >
>>> >
>>> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>>> >
>>> >
>>> >>
>>> >>
>>> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
>>> >>the end of the week? I'd like to try cutting an RC soonish...
>>> >>
>>> >>Patrick
>>> >>
>>> >>
>>> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>> >><fp...@yahoo.com.invalid> wrote:
>>> >>> +1 for the plan of releasing alpha versions.
>>> >>>
>>> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch
>>> available. ZK-1870 is in trunk, but it is still open because we need a 3.4
>>> patch.
>>> >>>
>>> >>> -Flavio
>>> >>>
>>> >>>
>>> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>> >>>
>>> >>>> Hey folks, we've been talking about it for a while, a few people have
>>> >>>> mentioned on the list as well as contacted me personally that they
>>> >>>> would like to see some progress on the first 3.5 release. Every
>>> >>>> release is a compromise, if we wait for perfection we'll never get
>>> >>>> anything out the door. 3.5 has tons of great new features, lots of
>>> >>>> hard work, let's get it out in a release so that folks can use it,
>>> >>>> test it, and give feedback.
>>> >>>>
>>> >>>> Jenkins jobs have been pretty stable except for the known flakey test
>>> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>> >>>>
>>> >>>> Here's my thinking again on how we should plan our releases:
>>> >>>>
>>> >>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>>> >>>> think we should do instead is similar to what we did for 3.4. (this is
>>> >>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>>> >>>> Start with a series of alpha releases, something people can run and
>>> >>>> test with, once we address all the blockers and feel comfortable with
>>> >>>> the apis & remaining jiras we then switch to beta. Once we get some
>>> >>>> good feedback we remove the alpha/beta moniker and look at making it
>>> >>>> "stable'. At some later point it will become the "current/stable"
>>> >>>> release, taking over from 3.4.x.
>>> >>>>
>>> >>>> e.g.
>>> >>>> 3.5.0-alpha (8 blockers)
>>> >>>> 3.5.1-alpha (3 blockers)
>>> >>>> 3.5.2-alpha (0 blockers)
>>> >>>> 3.5.3-beta (apis locked)
>>> >>>> 3.5.4-beta
>>> >>>> 3.5.5-beta
>>> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>>> >>>> maybe use it for production but we still expect things to shake out)
>>> >>>> 3.5.7
>>> >>>> ....
>>> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>>> etc...
>>> >>>>
>>> >>>> There are 8 blockers currently, are any of these something that should
>>> >>>> hold up 3.5.0-alpha?
>>> >>>>
>>> >>>> I'll hold open the discussion for a couple days. If folks find this a
>>> >>>> reasonable plan I'll start the ball rolling to cut an RC.
>>> >>>>
>>> >>>> Patrick
>>> >>>
>>> >>
>>> >>
>>> >>
>>>
>>
>>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Flavio Junqueira <fp...@yahoo.com.INVALID>.
Just so that we don´t delay too much, what if we release an alpha version without 1863 and 1807, and do another one in 2-3 weeks time?

-Flavio


On Thursday, July 3, 2014 6:12 AM, Raúl Gutiérrez Segalés <rg...@itevenworks.net> wrote:
 

>
>
>On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:
>
>> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
>> 1810 is waiting on feedback from Michi, and Camille is threatening to
>> commit 1863. I see some great progress in general on the patch
>> availables queue, which is great to see.
>>
>> So here's something else we might consider - should we drop jdk6
>> support from 3.5. It's long since EOL by Oracle but I suspect some
>> folks are still using ZK with 6. We gotta move forward though, can't
>> support it forever. Thoughts? Note that we are currently
>> building/testing trunk against jdk6, 7 and 8.
>> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>>
>
>Extra eyes/review for https://issues.apache.org/jira/browse/ZOOKEEPER-1807
>would be appreciated (otherwise anyone using Observers with the upcoming
>alpha release will see there network usage go wild...).
>
>
>-rgs
>
>
>
>
>
>> Patrick
>>
>> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
>> <fp...@yahoo.com.invalid> wrote:
>> > According to me, ZK-1810 should be in already, but I need a +1 there. I
>> think Michi hasn't checked in because LETest failed in the last QA run
>> there. However, that patch doesn't affect LETest, and in fact it fails in
>> trunk intermittently, so the test failure doesn't seem to be related to the
>> patch.
>> >
>> > I haven't checked ZK-1863, so I can't say anything concrete about it.
>> >
>> > -Flavio
>> >
>> >
>> >
>> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>> >
>> >
>> >>
>> >>
>> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
>> >>the end of the week? I'd like to try cutting an RC soonish...
>> >>
>> >>Patrick
>> >>
>> >>
>> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>> >><fp...@yahoo.com.invalid> wrote:
>> >>> +1 for the plan of releasing alpha versions.
>> >>>
>> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch
>> available. ZK-1870 is in trunk, but it is still open because we need a 3.4
>> patch.
>> >>>
>> >>> -Flavio
>> >>>
>> >>>
>> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>> >>>
>> >>>> Hey folks, we've been talking about it for a while, a few people have
>> >>>> mentioned on the list as well as contacted me personally that they
>> >>>> would like to see some progress on the first 3.5 release. Every
>> >>>> release is a compromise, if we wait for perfection we'll never get
>> >>>> anything out the door. 3.5 has tons of great new features, lots of
>> >>>> hard work, let's get it out in a release so that folks can use it,
>> >>>> test it, and give feedback.
>> >>>>
>> >>>> Jenkins jobs have been pretty stable except for the known flakey test
>> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
>> >>>>
>> >>>> Here's my thinking again on how we should plan our releases:
>> >>>>
>> >>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>> >>>> think we should do instead is similar to what we did for 3.4. (this is
>> >>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>> >>>> Start with a series of alpha releases, something people can run and
>> >>>> test with, once we address all the blockers and feel comfortable with
>> >>>> the apis & remaining jiras we then switch to beta. Once we get some
>> >>>> good feedback we remove the alpha/beta moniker and look at making it
>> >>>> "stable'. At some later point it will become the "current/stable"
>> >>>> release, taking over from 3.4.x.
>> >>>>
>> >>>> e.g.
>> >>>> 3.5.0-alpha (8 blockers)
>> >>>> 3.5.1-alpha (3 blockers)
>> >>>> 3.5.2-alpha (0 blockers)
>> >>>> 3.5.3-beta (apis locked)
>> >>>> 3.5.4-beta
>> >>>> 3.5.5-beta
>> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>> >>>> maybe use it for production but we still expect things to shake out)
>> >>>> 3.5.7
>> >>>> ....
>> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
>> etc...
>> >>>>
>> >>>> There are 8 blockers currently, are any of these something that should
>> >>>> hold up 3.5.0-alpha?
>> >>>>
>> >>>> I'll hold open the discussion for a couple days. If folks find this a
>> >>>> reasonable plan I'll start the ball rolling to cut an RC.
>> >>>>
>> >>>> Patrick
>> >>>
>> >>
>> >>
>> >>
>>
>
>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 2 July 2014 21:19, Patrick Hunt <ph...@apache.org> wrote:

> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
> 1810 is waiting on feedback from Michi, and Camille is threatening to
> commit 1863. I see some great progress in general on the patch
> availables queue, which is great to see.
>
> So here's something else we might consider - should we drop jdk6
> support from 3.5. It's long since EOL by Oracle but I suspect some
> folks are still using ZK with 6. We gotta move forward though, can't
> support it forever. Thoughts? Note that we are currently
> building/testing trunk against jdk6, 7 and 8.
> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>

Extra eyes/review for https://issues.apache.org/jira/browse/ZOOKEEPER-1807
would be appreciated (otherwise anyone using Observers with the upcoming
alpha release will see there network usage go wild...).


-rgs




> Patrick
>
> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> <fp...@yahoo.com.invalid> wrote:
> > According to me, ZK-1810 should be in already, but I need a +1 there. I
> think Michi hasn't checked in because LETest failed in the last QA run
> there. However, that patch doesn't affect LETest, and in fact it fails in
> trunk intermittently, so the test failure doesn't seem to be related to the
> patch.
> >
> > I haven't checked ZK-1863, so I can't say anything concrete about it.
> >
> > -Flavio
> >
> >
> >
> > On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
> >
> >
> >>
> >>
> >>Hi Flavio, do you think those jiras can get reviewed/finalized before
> >>the end of the week? I'd like to try cutting an RC soonish...
> >>
> >>Patrick
> >>
> >>
> >>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
> >><fp...@yahoo.com.invalid> wrote:
> >>> +1 for the plan of releasing alpha versions.
> >>>
> >>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch
> available. ZK-1870 is in trunk, but it is still open because we need a 3.4
> patch.
> >>>
> >>> -Flavio
> >>>
> >>>
> >>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
> >>>
> >>>> Hey folks, we've been talking about it for a while, a few people have
> >>>> mentioned on the list as well as contacted me personally that they
> >>>> would like to see some progress on the first 3.5 release. Every
> >>>> release is a compromise, if we wait for perfection we'll never get
> >>>> anything out the door. 3.5 has tons of great new features, lots of
> >>>> hard work, let's get it out in a release so that folks can use it,
> >>>> test it, and give feedback.
> >>>>
> >>>> Jenkins jobs have been pretty stable except for the known flakey test
> >>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> >>>> jenkins has also been verifying the code on jdk7 and jdk8.
> >>>>
> >>>> Here's my thinking again on how we should plan our releases:
> >>>>
> >>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
> >>>> think we should do instead is similar to what we did for 3.4. (this is
> >>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
> >>>> Start with a series of alpha releases, something people can run and
> >>>> test with, once we address all the blockers and feel comfortable with
> >>>> the apis & remaining jiras we then switch to beta. Once we get some
> >>>> good feedback we remove the alpha/beta moniker and look at making it
> >>>> "stable'. At some later point it will become the "current/stable"
> >>>> release, taking over from 3.4.x.
> >>>>
> >>>> e.g.
> >>>> 3.5.0-alpha (8 blockers)
> >>>> 3.5.1-alpha (3 blockers)
> >>>> 3.5.2-alpha (0 blockers)
> >>>> 3.5.3-beta (apis locked)
> >>>> 3.5.4-beta
> >>>> 3.5.5-beta
> >>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
> >>>> maybe use it for production but we still expect things to shake out)
> >>>> 3.5.7
> >>>> ....
> >>>> 3.5.x - ready to replace 3.4 releases for production use, stable,
> etc...
> >>>>
> >>>> There are 8 blockers currently, are any of these something that should
> >>>> hold up 3.5.0-alpha?
> >>>>
> >>>> I'll hold open the discussion for a couple days. If folks find this a
> >>>> reasonable plan I'll start the ball rolling to cut an RC.
> >>>>
> >>>> Patrick
> >>>
> >>
> >>
> >>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Michi Mutsuzaki <mi...@cs.stanford.edu>.
I just canceled the patch for 1810 because it doesn't apply anymore.
It's good to go once the patch is rebased.

On Wed, Jul 2, 2014 at 9:19 PM, Patrick Hunt <ph...@apache.org> wrote:
> Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
> 1810 is waiting on feedback from Michi, and Camille is threatening to
> commit 1863. I see some great progress in general on the patch
> availables queue, which is great to see.
>
> So here's something else we might consider - should we drop jdk6
> support from 3.5. It's long since EOL by Oracle but I suspect some
> folks are still using ZK with 6. We gotta move forward though, can't
> support it forever. Thoughts? Note that we are currently
> building/testing trunk against jdk6, 7 and 8.
> https://builds.apache.org/view/S-Z/view/ZooKeeper/
>
> Patrick
>
> On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
> <fp...@yahoo.com.invalid> wrote:
>> According to me, ZK-1810 should be in already, but I need a +1 there. I think Michi hasn't checked in because LETest failed in the last QA run there. However, that patch doesn't affect LETest, and in fact it fails in trunk intermittently, so the test failure doesn't seem to be related to the patch.
>>
>> I haven't checked ZK-1863, so I can't say anything concrete about it.
>>
>> -Flavio
>>
>>
>>
>> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>>
>>
>>>
>>>
>>>Hi Flavio, do you think those jiras can get reviewed/finalized before
>>>the end of the week? I'd like to try cutting an RC soonish...
>>>
>>>Patrick
>>>
>>>
>>>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>>><fp...@yahoo.com.invalid> wrote:
>>>> +1 for the plan of releasing alpha versions.
>>>>
>>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.
>>>>
>>>> -Flavio
>>>>
>>>>
>>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>>>
>>>>> Hey folks, we've been talking about it for a while, a few people have
>>>>> mentioned on the list as well as contacted me personally that they
>>>>> would like to see some progress on the first 3.5 release. Every
>>>>> release is a compromise, if we wait for perfection we'll never get
>>>>> anything out the door. 3.5 has tons of great new features, lots of
>>>>> hard work, let's get it out in a release so that folks can use it,
>>>>> test it, and give feedback.
>>>>>
>>>>> Jenkins jobs have been pretty stable except for the known flakey test
>>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>>>
>>>>> Here's my thinking again on how we should plan our releases:
>>>>>
>>>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>>>>> think we should do instead is similar to what we did for 3.4. (this is
>>>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>>>>> Start with a series of alpha releases, something people can run and
>>>>> test with, once we address all the blockers and feel comfortable with
>>>>> the apis & remaining jiras we then switch to beta. Once we get some
>>>>> good feedback we remove the alpha/beta moniker and look at making it
>>>>> "stable'. At some later point it will become the "current/stable"
>>>>> release, taking over from 3.4.x.
>>>>>
>>>>> e.g.
>>>>> 3.5.0-alpha (8 blockers)
>>>>> 3.5.1-alpha (3 blockers)
>>>>> 3.5.2-alpha (0 blockers)
>>>>> 3.5.3-beta (apis locked)
>>>>> 3.5.4-beta
>>>>> 3.5.5-beta
>>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>>>>> maybe use it for production but we still expect things to shake out)
>>>>> 3.5.7
>>>>> ....
>>>>> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>>>>>
>>>>> There are 8 blockers currently, are any of these something that should
>>>>> hold up 3.5.0-alpha?
>>>>>
>>>>> I'll hold open the discussion for a couple days. If folks find this a
>>>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>>>
>>>>> Patrick
>>>>
>>>
>>>
>>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Update: we're down to 7 blockers on 5.1.0 (from 8 in the last check).
1810 is waiting on feedback from Michi, and Camille is threatening to
commit 1863. I see some great progress in general on the patch
availables queue, which is great to see.

So here's something else we might consider - should we drop jdk6
support from 3.5. It's long since EOL by Oracle but I suspect some
folks are still using ZK with 6. We gotta move forward though, can't
support it forever. Thoughts? Note that we are currently
building/testing trunk against jdk6, 7 and 8.
https://builds.apache.org/view/S-Z/view/ZooKeeper/

Patrick

On Tue, Jul 1, 2014 at 2:26 AM, Flavio Junqueira
<fp...@yahoo.com.invalid> wrote:
> According to me, ZK-1810 should be in already, but I need a +1 there. I think Michi hasn't checked in because LETest failed in the last QA run there. However, that patch doesn't affect LETest, and in fact it fails in trunk intermittently, so the test failure doesn't seem to be related to the patch.
>
> I haven't checked ZK-1863, so I can't say anything concrete about it.
>
> -Flavio
>
>
>
> On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
>
>
>>
>>
>>Hi Flavio, do you think those jiras can get reviewed/finalized before
>>the end of the week? I'd like to try cutting an RC soonish...
>>
>>Patrick
>>
>>
>>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
>><fp...@yahoo.com.invalid> wrote:
>>> +1 for the plan of releasing alpha versions.
>>>
>>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.
>>>
>>> -Flavio
>>>
>>>
>>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>>
>>>> Hey folks, we've been talking about it for a while, a few people have
>>>> mentioned on the list as well as contacted me personally that they
>>>> would like to see some progress on the first 3.5 release. Every
>>>> release is a compromise, if we wait for perfection we'll never get
>>>> anything out the door. 3.5 has tons of great new features, lots of
>>>> hard work, let's get it out in a release so that folks can use it,
>>>> test it, and give feedback.
>>>>
>>>> Jenkins jobs have been pretty stable except for the known flakey test
>>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>>
>>>> Here's my thinking again on how we should plan our releases:
>>>>
>>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>>>> think we should do instead is similar to what we did for 3.4. (this is
>>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>>>> Start with a series of alpha releases, something people can run and
>>>> test with, once we address all the blockers and feel comfortable with
>>>> the apis & remaining jiras we then switch to beta. Once we get some
>>>> good feedback we remove the alpha/beta moniker and look at making it
>>>> "stable'. At some later point it will become the "current/stable"
>>>> release, taking over from 3.4.x.
>>>>
>>>> e.g.
>>>> 3.5.0-alpha (8 blockers)
>>>> 3.5.1-alpha (3 blockers)
>>>> 3.5.2-alpha (0 blockers)
>>>> 3.5.3-beta (apis locked)
>>>> 3.5.4-beta
>>>> 3.5.5-beta
>>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>>>> maybe use it for production but we still expect things to shake out)
>>>> 3.5.7
>>>> ....
>>>> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>>>>
>>>> There are 8 blockers currently, are any of these something that should
>>>> hold up 3.5.0-alpha?
>>>>
>>>> I'll hold open the discussion for a couple days. If folks find this a
>>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>>
>>>> Patrick
>>>
>>
>>
>>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Flavio Junqueira <fp...@yahoo.com.INVALID>.
According to me, ZK-1810 should be in already, but I need a +1 there. I think Michi hasn't checked in because LETest failed in the last QA run there. However, that patch doesn't affect LETest, and in fact it fails in trunk intermittently, so the test failure doesn't seem to be related to the patch.

I haven't checked ZK-1863, so I can't say anything concrete about it.

-Flavio



On Tuesday, July 1, 2014 5:53 AM, Patrick Hunt <ph...@apache.org> wrote:
 

>
>
>Hi Flavio, do you think those jiras can get reviewed/finalized before
>the end of the week? I'd like to try cutting an RC soonish...
>
>Patrick
>
>
>On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
><fp...@yahoo.com.invalid> wrote:
>> +1 for the plan of releasing alpha versions.
>>
>> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.
>>
>> -Flavio
>>
>>
>> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>>
>>> Hey folks, we've been talking about it for a while, a few people have
>>> mentioned on the list as well as contacted me personally that they
>>> would like to see some progress on the first 3.5 release. Every
>>> release is a compromise, if we wait for perfection we'll never get
>>> anything out the door. 3.5 has tons of great new features, lots of
>>> hard work, let's get it out in a release so that folks can use it,
>>> test it, and give feedback.
>>>
>>> Jenkins jobs have been pretty stable except for the known flakey test
>>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>>> jenkins has also been verifying the code on jdk7 and jdk8.
>>>
>>> Here's my thinking again on how we should plan our releases:
>>>
>>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>>> think we should do instead is similar to what we did for 3.4. (this is
>>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>>> Start with a series of alpha releases, something people can run and
>>> test with, once we address all the blockers and feel comfortable with
>>> the apis & remaining jiras we then switch to beta. Once we get some
>>> good feedback we remove the alpha/beta moniker and look at making it
>>> "stable'. At some later point it will become the "current/stable"
>>> release, taking over from 3.4.x.
>>>
>>> e.g.
>>> 3.5.0-alpha (8 blockers)
>>> 3.5.1-alpha (3 blockers)
>>> 3.5.2-alpha (0 blockers)
>>> 3.5.3-beta (apis locked)
>>> 3.5.4-beta
>>> 3.5.5-beta
>>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>>> maybe use it for production but we still expect things to shake out)
>>> 3.5.7
>>> ....
>>> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>>>
>>> There are 8 blockers currently, are any of these something that should
>>> hold up 3.5.0-alpha?
>>>
>>> I'll hold open the discussion for a couple days. If folks find this a
>>> reasonable plan I'll start the ball rolling to cut an RC.
>>>
>>> Patrick
>>
>
>
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Patrick Hunt <ph...@apache.org>.
Hi Flavio, do you think those jiras can get reviewed/finalized before
the end of the week? I'd like to try cutting an RC soonish...

Patrick

On Sun, Jun 29, 2014 at 5:02 AM, Flavio Junqueira
<fp...@yahoo.com.invalid> wrote:
> +1 for the plan of releasing alpha versions.
>
> I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.
>
> -Flavio
>
>
> On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:
>
>> Hey folks, we've been talking about it for a while, a few people have
>> mentioned on the list as well as contacted me personally that they
>> would like to see some progress on the first 3.5 release. Every
>> release is a compromise, if we wait for perfection we'll never get
>> anything out the door. 3.5 has tons of great new features, lots of
>> hard work, let's get it out in a release so that folks can use it,
>> test it, and give feedback.
>>
>> Jenkins jobs have been pretty stable except for the known flakey test
>> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
>> jenkins has also been verifying the code on jdk7 and jdk8.
>>
>> Here's my thinking again on how we should plan our releases:
>>
>> I don't think we'll be able to do a 3.5.x-stable for some time. What I
>> think we should do instead is similar to what we did for 3.4. (this is
>> also similar to what Hadoop did during their Hadoop 2 release cycle)
>> Start with a series of alpha releases, something people can run and
>> test with, once we address all the blockers and feel comfortable with
>> the apis & remaining jiras we then switch to beta. Once we get some
>> good feedback we remove the alpha/beta moniker and look at making it
>> "stable'. At some later point it will become the "current/stable"
>> release, taking over from 3.4.x.
>>
>> e.g.
>> 3.5.0-alpha (8 blockers)
>> 3.5.1-alpha (3 blockers)
>> 3.5.2-alpha (0 blockers)
>> 3.5.3-beta (apis locked)
>> 3.5.4-beta
>> 3.5.5-beta
>> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
>> maybe use it for production but we still expect things to shake out)
>> 3.5.7
>> ....
>> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>>
>> There are 8 blockers currently, are any of these something that should
>> hold up 3.5.0-alpha?
>>
>> I'll hold open the discussion for a couple days. If folks find this a
>> reasonable plan I'll start the ball rolling to cut an RC.
>>
>> Patrick
>

Re: ZooKeeper 3.5.0-alpha planning

Posted by Flavio Junqueira <fp...@yahoo.com.INVALID>.
+1 for the plan of releasing alpha versions.

I'd like to have ZK-1818 (ZK-1810) and ZK-1863 in. They are both patch available. ZK-1870 is in trunk, but it is still open because we need a 3.4 patch.

-Flavio


On 26 Jun 2014, at 01:07, Patrick Hunt <ph...@apache.org> wrote:

> Hey folks, we've been talking about it for a while, a few people have
> mentioned on the list as well as contacted me personally that they
> would like to see some progress on the first 3.5 release. Every
> release is a compromise, if we wait for perfection we'll never get
> anything out the door. 3.5 has tons of great new features, lots of
> hard work, let's get it out in a release so that folks can use it,
> test it, and give feedback.
> 
> Jenkins jobs have been pretty stable except for the known flakey test
> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> jenkins has also been verifying the code on jdk7 and jdk8.
> 
> Here's my thinking again on how we should plan our releases:
> 
> I don't think we'll be able to do a 3.5.x-stable for some time. What I
> think we should do instead is similar to what we did for 3.4. (this is
> also similar to what Hadoop did during their Hadoop 2 release cycle)
> Start with a series of alpha releases, something people can run and
> test with, once we address all the blockers and feel comfortable with
> the apis & remaining jiras we then switch to beta. Once we get some
> good feedback we remove the alpha/beta moniker and look at making it
> "stable'. At some later point it will become the "current/stable"
> release, taking over from 3.4.x.
> 
> e.g.
> 3.5.0-alpha (8 blockers)
> 3.5.1-alpha (3 blockers)
> 3.5.2-alpha (0 blockers)
> 3.5.3-beta (apis locked)
> 3.5.4-beta
> 3.5.5-beta
> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
> maybe use it for production but we still expect things to shake out)
> 3.5.7
> ....
> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
> 
> There are 8 blockers currently, are any of these something that should
> hold up 3.5.0-alpha?
> 
> I'll hold open the discussion for a couple days. If folks find this a
> reasonable plan I'll start the ball rolling to cut an RC.
> 
> Patrick


Re: ZooKeeper 3.5.0-alpha planning

Posted by Raúl Gutiérrez Segalés <rg...@itevenworks.net>.
On 25 June 2014 17:07, Patrick Hunt <ph...@apache.org> wrote:

> Hey folks, we've been talking about it for a while, a few people have
> mentioned on the list as well as contacted me personally that they
> would like to see some progress on the first 3.5 release. Every
> release is a compromise, if we wait for perfection we'll never get
> anything out the door. 3.5 has tons of great new features, lots of
> hard work, let's get it out in a release so that folks can use it,
> test it, and give feedback.
>
> Jenkins jobs have been pretty stable except for the known flakey test
> ZOOKEEPER-1870 which Flavio committed today to trunk. Note that
> jenkins has also been verifying the code on jdk7 and jdk8.
>
> Here's my thinking again on how we should plan our releases:
>
> I don't think we'll be able to do a 3.5.x-stable for some time. What I
> think we should do instead is similar to what we did for 3.4. (this is
> also similar to what Hadoop did during their Hadoop 2 release cycle)
> Start with a series of alpha releases, something people can run and
> test with, once we address all the blockers and feel comfortable with
> the apis & remaining jiras we then switch to beta. Once we get some
> good feedback we remove the alpha/beta moniker and look at making it
> "stable'. At some later point it will become the "current/stable"
> release, taking over from 3.4.x.
>
> e.g.
> 3.5.0-alpha (8 blockers)
> 3.5.1-alpha (3 blockers)
> 3.5.2-alpha (0 blockers)
> 3.5.3-beta (apis locked)
> 3.5.4-beta
> 3.5.5-beta
> 3.5.6 (no longer considered alpha/beta but also not "stable" vs 3.4.x,
> maybe use it for production but we still expect things to shake out)
> 3.5.7
> ....
> 3.5.x - ready to replace 3.4 releases for production use, stable, etc...
>
> There are 8 blockers currently, are any of these something that should
> hold up 3.5.0-alpha?
>
> I'll hold open the discussion for a couple days. If folks find this a
> reasonable plan I'll start the ball rolling to cut an RC.
>
>
sgtm, +1.


-rgs