You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@fluo.apache.org by Arvind Shyamsundar <ar...@microsoft.com.INVALID> on 2020/05/06 08:08:58 UTC

Re: [EXTERNAL] Re: JDK14 and Fluo

Incidentally I did a brief test yesterday with Muchos and jdk 14. I saw similar issues with zk client (within Hadoop zkfc) being unable to connect to zk, complaining about an unresolved address. I did find a vaguely related documented change in jdk14 for ipv6 addresses but gave up after a while. I can send details, on Wednesday.

Sent from Outlook Mobile<https://aka.ms/blhgte>

________________________________
From: Christopher <ct...@apache.org>
Sent: Wednesday, May 6, 2020 12:50:39 AM
To: fluo-dev <de...@fluo.apache.org>
Subject: [EXTERNAL] Re: JDK14 and Fluo

So, a quick follow-up to this:

The CMS flags seem to be ignored, so that's not really a problem:

OpenJDK 64-Bit Server VM warning: Ignoring option UseConcMarkSweepGC;
support was removed in 14.0
OpenJDK 64-Bit Server VM warning: Ignoring option
CMSInitiatingOccupancyFraction; support was removed in 14.0

These are just warnings about the flags being ignored.

I may be having other problems on my machine that prevent the tests
from running. I'm not sure, but it seems that Accumulo's Initialize is
failing to connect to the ZooKeeper server within the 30s timeout
period. There does not appear to be any problems with ZooKeeper
itself, and I can use telnet to send `ruok`. My machine might just be
slow... I'll have to troubleshoot later.

On Wed, May 6, 2020 at 2:52 AM Christopher <ct...@apache.org> wrote:
>
> I have switched my development environment to primarily run with JDK14
> and noticed some incompatibilities with Fluo.
>
> Some of these are now fixed in a PR: https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=joEvtJp2kDgS%2F8iI9qNh%2F8NL2qVgo7V37DuP49NfWxw%3D&amp;reserved=0
>
> However, this doesn't completely fix JDK14 builds. Specifically, JDK14
> removed the Concurrent Mark Sweep garbage collector. This garbage
> collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests
> cannot run in Fluo on JDK14.
>
> This issue has already been addressed in upstream Accumulo
> (https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=GzzmfMgNvw135zQhKSVmara4uBu6%2BgVfHE7YtXlhSjs%3D&amp;reserved=0) for Accumulo 2.1.0.
>
> My proposed resolution is that Fluo should use an updated version of
> accumulo2-maven-plugin, after Accumulo 2.1.0 is released with the
> MiniAccumuloCluster improvements. Of course, this requires Accumulo
> 2.1.0 to be released first.

RE: [EXTERNAL] Re: JDK14 and Fluo

Posted by Arvind Shyamsundar <ar...@microsoft.com.INVALID>.
Thank you, Christopher. My tests are with a cluster that uses HDFS in a Highly Available (HA) configuration. The HDFS ZKFC component (which is used in the HA configuration) has a dependency on ZK. I repeated my test with a snapshot build of Hadoop-3.3.1 and ZK 3.5.7 and it worked. I'm not sure what other impact we might expect with Hadoop 3.3 - has anyone been testing with that version?

Warm Regards,

Arvind.

-----Original Message-----
From: Christopher <ct...@apache.org> 
Sent: Thursday, May 14, 2020 10:45 AM
To: fluo-dev <de...@fluo.apache.org>
Subject: Re: [EXTERNAL] Re: JDK14 and Fluo

Hadoop itself doesn't use ZooKeeper. I'm using Hadoop 3.2.1. The open ZK bug appears to be for 3.4, which is EOL. I have been actively contributing to ZK in the past few weeks, where I and others have been using and testing newer JDK versions with 3.5, 3.6, and 3.7, and fixing issues as they are observed. As far as I know, 3.5.8, 3.6.1, and 3.7.0-SNAPSHOT are fine with JDK14.


On Thu, May 14, 2020 at 1:29 PM Arvind Shyamsundar <ar...@microsoft.com.invalid> wrote:
>
> Thank you, Christopher. Can you comment on what version of Hadoop you are using? We've been pointed by some JDK experts at Microsoft to this ZK issue: https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FZOOKEEPER-3779&amp;data=02%7C01%7Carvindsh%40microsoft.com%7C14e74746812045583ddd08d7f82e807e%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637250750928449394&amp;sdata=l88wA8zmn%2FN37mrxFNLg5vbGhidxnKaQLpoASOqziG0%3D&amp;reserved=0. As far as I can see, Hadoop 3.2.1 (which is what I was using) references ZK 3.4.14, which would perhaps explain the issue. I'm going to repeat my tests with a snapshot build of Hadoop branch-3.3 and report back.
>
> Stay safe everyone!
>
> Warm Regards,
>
> Arvind.
>
> -----Original Message-----
> From: Christopher <ct...@apache.org>
> Sent: Thursday, May 14, 2020 4:42 AM
> To: fluo-dev <de...@fluo.apache.org>
> Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
>
> Just a follow up from this. As far as I can tell, the problem I was having was related to my system's hostname resolver, and that jdk14 works fine.
>
> On Wed, May 6, 2020, 18:10 Arvind Shyamsundar <ar...@microsoft.com.invalid> wrote:
>
> > FWIW, sharing my notes - here's what I see when attempting to setup 
> > an cluster (Accumulo-2.1.0-SNAPSHOT, Hadoop 3.2.1, ZooKeeper 3.5.7) 
> > with OpenJDK 14. The step where the Hadoop ZKFC (hdfs zkfc -formatZK
> > -force) is configured fails with the below exception, which seemed 
> > vaguely similar to Christopher's report of Accumulo being unable to connect to ZK.
> >
> > 2020-05-06 22:00:47,518 INFO zookeeper.ClientCnxn: Opening socket 
> > connection to server leader-3/<unresolved>:2181. Will not attempt to 
> > authenticate using SASL (unknown error)
> > 2020-05-06 22:00:47,518 WARN zookeeper.ClientCnxn: Session 0x0 for 
> > server leader-3/<unresolved>:2181, unexpected error, closing socket 
> > connection and attempting reconnect 
> > java.nio.channels.UnresolvedAddressException
> >         at java.base/sun.nio.ch.Net.checkAddress(Net.java:139)
> >         at java.base/sun.nio.ch
> > .SocketChannelImpl.checkRemote(SocketChannelImpl.java:727)
> >         at java.base/sun.nio.ch
> > .SocketChannelImpl.connect(SocketChannelImpl.java:741)
> >         at
> > org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
> >         at
> > org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
> >         at
> > org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1021)
> >         at
> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1064)
> >
> > (the above is repeated for the other 2 ZK servers configured in this 
> > cluster as well). The JDK item I found (again, this is all 
> > circumstantial, and I have not spent any time to dig deep) was 
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbu
> > gs
> > .openjdk.java.net%2Fbrowse%2FJDK-8225499&amp;data=02%7C01%7Carvindsh
> > %4
> > 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af
> > 91 
> > ab2d7cd011db47%7C1%7C0%7C637250533625323572&amp;sdata=TmSrpgDIlpxltWzH7GGduHqHARmonstwmorQ94yh0o0%3D&amp;reserved=0. The associated note in JDK 14 release notes (https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjdk.java.net%2F14%2Frelease-notes&amp;data=02%7C01%7Carvindsh%40microsoft.com%7C14e74746812045583ddd08d7f82e807e%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637250750928449394&amp;sdata=DpfMByRjIF63FRbKn0%2F4gyyeiYqvGeMA1xgUZJmZKds%3D&amp;reserved=0) has the below text, which seemed interesting as it kind of matches up with the symptoms above:
> >
> > Additionally, the string format for unresolved addresses has been changed.
> > The method now represents the literal IP address with the token 
> > <unresolved>, for example: foo/<unresolved>:80 instead of foo:80. 
> > This is based on InetAddress::toString, which returns a string of 
> > the form "hostname / literal IP address". To retrieve a string 
> > representation of the hostname, or the string form of the address if 
> > it doesn't have a hostname, use InetSocketAddress::getHostString, 
> > rather than parsing the string representation.
> >
> > - Arvind
> >
> > From: Arvind Shyamsundar <ar...@microsoft.com>
> > Sent: Wednesday, May 6, 2020 1:09 AM
> > To: dev@fluo.apache.org
> > Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
> >
> > Incidentally I did a brief test yesterday with Muchos and jdk 14. I 
> > saw similar issues with zk client (within Hadoop zkfc) being unable 
> > to connect to zk, complaining about an unresolved address. I did 
> > find a vaguely related documented change in jdk14 for ipv6 addresses 
> > but gave up after a while. I can send details, on Wednesday.
> > Sent from Outlook Mobile<https://aka.ms/blhgte>
> >
> > ________________________________
> > From: Christopher <ct...@apache.org>>
> > Sent: Wednesday, May 6, 2020 12:50:39 AM
> > To: fluo-dev <de...@fluo.apache.org>>
> > Subject: [EXTERNAL] Re: JDK14 and Fluo
> >
> > So, a quick follow-up to this:
> >
> > The CMS flags seem to be ignored, so that's not really a problem:
> >
> > OpenJDK 64-Bit Server VM warning: Ignoring option 
> > UseConcMarkSweepGC; support was removed in 14.0 OpenJDK 64-Bit 
> > Server VM warning: Ignoring option CMSInitiatingOccupancyFraction; 
> > support was removed in 14.0
> >
> > These are just warnings about the flags being ignored.
> >
> > I may be having other problems on my machine that prevent the tests 
> > from running. I'm not sure, but it seems that Accumulo's Initialize 
> > is failing to connect to the ZooKeeper server within the 30s timeout 
> > period. There does not appear to be any problems with ZooKeeper 
> > itself, and I can use telnet to send `ruok`. My machine might just 
> > be slow... I'll have to troubleshoot later.
> >
> > On Wed, May 6, 2020 at 2:52 AM Christopher <ctubbsii@apache.org<mailto:
> > ctubbsii@apache.org>> wrote:
> > >
> > > I have switched my development environment to primarily run with
> > > JDK14 and noticed some incompatibilities with Fluo.
> > >
> > > Some of these are now fixed in a PR:
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgi
> > th 
> > ub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40m
> > ic 
> > rosoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91ab
> > 2d
> > 7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=TkVp97HUB3ugh%2FYq
> > d1
> > 8asT8qJIukZxCoT8g4p8AT40M%3D&amp;reserved=0
> > >
> > > However, this doesn't completely fix JDK14 builds. Specifically,
> > > JDK14 removed the Concurrent Mark Sweep garbage collector. This 
> > > garbage collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> > > Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests 
> > > cannot run in Fluo on JDK14.
> > >
> > > This issue has already been addressed in upstream Accumulo (
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgi
> > th
> > ub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh
> > %4
> > 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af
> > 91 
> > ab2d7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=D8n09flrBYZVAw
> > UV
> > fYcOYNIni3Qgk%2BRuJkTK8OnX6xQ%3D&amp;reserved=0)
> > for Accumulo 2.1.0.
> > >
> > > My proposed resolution is that Fluo should use an updated version 
> > > of accumulo2-maven-plugin, after Accumulo 2.1.0 is released with 
> > > the MiniAccumuloCluster improvements. Of course, this requires 
> > > Accumulo
> > > 2.1.0 to be released first.
> >

Re: [EXTERNAL] Re: JDK14 and Fluo

Posted by Christopher <ct...@apache.org>.
Hadoop itself doesn't use ZooKeeper. I'm using Hadoop 3.2.1. The open
ZK bug appears to be for 3.4, which is EOL. I have been actively
contributing to ZK in the past few weeks, where I and others have been
using and testing newer JDK versions with 3.5, 3.6, and 3.7, and
fixing issues as they are observed. As far as I know, 3.5.8, 3.6.1,
and 3.7.0-SNAPSHOT are fine with JDK14.


On Thu, May 14, 2020 at 1:29 PM Arvind Shyamsundar
<ar...@microsoft.com.invalid> wrote:
>
> Thank you, Christopher. Can you comment on what version of Hadoop you are using? We've been pointed by some JDK experts at Microsoft to this ZK issue: https://issues.apache.org/jira/browse/ZOOKEEPER-3779. As far as I can see, Hadoop 3.2.1 (which is what I was using) references ZK 3.4.14, which would perhaps explain the issue. I'm going to repeat my tests with a snapshot build of Hadoop branch-3.3 and report back.
>
> Stay safe everyone!
>
> Warm Regards,
>
> Arvind.
>
> -----Original Message-----
> From: Christopher <ct...@apache.org>
> Sent: Thursday, May 14, 2020 4:42 AM
> To: fluo-dev <de...@fluo.apache.org>
> Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
>
> Just a follow up from this. As far as I can tell, the problem I was having was related to my system's hostname resolver, and that jdk14 works fine.
>
> On Wed, May 6, 2020, 18:10 Arvind Shyamsundar <ar...@microsoft.com.invalid> wrote:
>
> > FWIW, sharing my notes - here's what I see when attempting to setup an
> > cluster (Accumulo-2.1.0-SNAPSHOT, Hadoop 3.2.1, ZooKeeper 3.5.7) with
> > OpenJDK 14. The step where the Hadoop ZKFC (hdfs zkfc -formatZK
> > -force) is configured fails with the below exception, which seemed
> > vaguely similar to Christopher's report of Accumulo being unable to connect to ZK.
> >
> > 2020-05-06 22:00:47,518 INFO zookeeper.ClientCnxn: Opening socket
> > connection to server leader-3/<unresolved>:2181. Will not attempt to
> > authenticate using SASL (unknown error)
> > 2020-05-06 22:00:47,518 WARN zookeeper.ClientCnxn: Session 0x0 for
> > server leader-3/<unresolved>:2181, unexpected error, closing socket
> > connection and attempting reconnect
> > java.nio.channels.UnresolvedAddressException
> >         at java.base/sun.nio.ch.Net.checkAddress(Net.java:139)
> >         at java.base/sun.nio.ch
> > .SocketChannelImpl.checkRemote(SocketChannelImpl.java:727)
> >         at java.base/sun.nio.ch
> > .SocketChannelImpl.connect(SocketChannelImpl.java:741)
> >         at
> > org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
> >         at
> > org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
> >         at
> > org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1021)
> >         at
> > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1064)
> >
> > (the above is repeated for the other 2 ZK servers configured in this
> > cluster as well). The JDK item I found (again, this is all
> > circumstantial, and I have not spent any time to dig deep) was
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugs
> > .openjdk.java.net%2Fbrowse%2FJDK-8225499&amp;data=02%7C01%7Carvindsh%4
> > 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91
> > ab2d7cd011db47%7C1%7C0%7C637250533625323572&amp;sdata=TmSrpgDIlpxltWzH7GGduHqHARmonstwmorQ94yh0o0%3D&amp;reserved=0. The associated note in JDK 14 release notes (https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjdk.java.net%2F14%2Frelease-notes&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=RxZnt3FJ1Jn0fNzTmDM48o5JT%2FAYka8N3Aur6i1frPA%3D&amp;reserved=0) has the below text, which seemed interesting as it kind of matches up with the symptoms above:
> >
> > Additionally, the string format for unresolved addresses has been changed.
> > The method now represents the literal IP address with the token
> > <unresolved>, for example: foo/<unresolved>:80 instead of foo:80. This
> > is based on InetAddress::toString, which returns a string of the form
> > "hostname / literal IP address". To retrieve a string representation
> > of the hostname, or the string form of the address if it doesn't have
> > a hostname, use InetSocketAddress::getHostString, rather than parsing
> > the string representation.
> >
> > - Arvind
> >
> > From: Arvind Shyamsundar <ar...@microsoft.com>
> > Sent: Wednesday, May 6, 2020 1:09 AM
> > To: dev@fluo.apache.org
> > Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
> >
> > Incidentally I did a brief test yesterday with Muchos and jdk 14. I
> > saw similar issues with zk client (within Hadoop zkfc) being unable to
> > connect to zk, complaining about an unresolved address. I did find a
> > vaguely related documented change in jdk14 for ipv6 addresses but gave
> > up after a while. I can send details, on Wednesday.
> > Sent from Outlook Mobile<https://aka.ms/blhgte>
> >
> > ________________________________
> > From: Christopher <ct...@apache.org>>
> > Sent: Wednesday, May 6, 2020 12:50:39 AM
> > To: fluo-dev <de...@fluo.apache.org>>
> > Subject: [EXTERNAL] Re: JDK14 and Fluo
> >
> > So, a quick follow-up to this:
> >
> > The CMS flags seem to be ignored, so that's not really a problem:
> >
> > OpenJDK 64-Bit Server VM warning: Ignoring option UseConcMarkSweepGC;
> > support was removed in 14.0 OpenJDK 64-Bit Server VM warning: Ignoring
> > option CMSInitiatingOccupancyFraction; support was removed in 14.0
> >
> > These are just warnings about the flags being ignored.
> >
> > I may be having other problems on my machine that prevent the tests
> > from running. I'm not sure, but it seems that Accumulo's Initialize is
> > failing to connect to the ZooKeeper server within the 30s timeout
> > period. There does not appear to be any problems with ZooKeeper
> > itself, and I can use telnet to send `ruok`. My machine might just be
> > slow... I'll have to troubleshoot later.
> >
> > On Wed, May 6, 2020 at 2:52 AM Christopher <ctubbsii@apache.org<mailto:
> > ctubbsii@apache.org>> wrote:
> > >
> > > I have switched my development environment to primarily run with
> > > JDK14 and noticed some incompatibilities with Fluo.
> > >
> > > Some of these are now fixed in a PR:
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> > ub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40mic
> > rosoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91ab2d
> > 7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=TkVp97HUB3ugh%2FYqd1
> > 8asT8qJIukZxCoT8g4p8AT40M%3D&amp;reserved=0
> > >
> > > However, this doesn't completely fix JDK14 builds. Specifically,
> > > JDK14 removed the Concurrent Mark Sweep garbage collector. This
> > > garbage collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> > > Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests
> > > cannot run in Fluo on JDK14.
> > >
> > > This issue has already been addressed in upstream Accumulo (
> > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> > ub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh%4
> > 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91
> > ab2d7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=D8n09flrBYZVAwUV
> > fYcOYNIni3Qgk%2BRuJkTK8OnX6xQ%3D&amp;reserved=0)
> > for Accumulo 2.1.0.
> > >
> > > My proposed resolution is that Fluo should use an updated version of
> > > accumulo2-maven-plugin, after Accumulo 2.1.0 is released with the
> > > MiniAccumuloCluster improvements. Of course, this requires Accumulo
> > > 2.1.0 to be released first.
> >

RE: [EXTERNAL] Re: JDK14 and Fluo

Posted by Arvind Shyamsundar <ar...@microsoft.com.INVALID>.
Thank you, Christopher. Can you comment on what version of Hadoop you are using? We've been pointed by some JDK experts at Microsoft to this ZK issue: https://issues.apache.org/jira/browse/ZOOKEEPER-3779. As far as I can see, Hadoop 3.2.1 (which is what I was using) references ZK 3.4.14, which would perhaps explain the issue. I'm going to repeat my tests with a snapshot build of Hadoop branch-3.3 and report back.

Stay safe everyone!

Warm Regards,

Arvind.

-----Original Message-----
From: Christopher <ct...@apache.org> 
Sent: Thursday, May 14, 2020 4:42 AM
To: fluo-dev <de...@fluo.apache.org>
Subject: Re: [EXTERNAL] Re: JDK14 and Fluo

Just a follow up from this. As far as I can tell, the problem I was having was related to my system's hostname resolver, and that jdk14 works fine.

On Wed, May 6, 2020, 18:10 Arvind Shyamsundar <ar...@microsoft.com.invalid> wrote:

> FWIW, sharing my notes - here's what I see when attempting to setup an 
> cluster (Accumulo-2.1.0-SNAPSHOT, Hadoop 3.2.1, ZooKeeper 3.5.7) with 
> OpenJDK 14. The step where the Hadoop ZKFC (hdfs zkfc -formatZK 
> -force) is configured fails with the below exception, which seemed 
> vaguely similar to Christopher's report of Accumulo being unable to connect to ZK.
>
> 2020-05-06 22:00:47,518 INFO zookeeper.ClientCnxn: Opening socket 
> connection to server leader-3/<unresolved>:2181. Will not attempt to 
> authenticate using SASL (unknown error)
> 2020-05-06 22:00:47,518 WARN zookeeper.ClientCnxn: Session 0x0 for 
> server leader-3/<unresolved>:2181, unexpected error, closing socket 
> connection and attempting reconnect 
> java.nio.channels.UnresolvedAddressException
>         at java.base/sun.nio.ch.Net.checkAddress(Net.java:139)
>         at java.base/sun.nio.ch
> .SocketChannelImpl.checkRemote(SocketChannelImpl.java:727)
>         at java.base/sun.nio.ch
> .SocketChannelImpl.connect(SocketChannelImpl.java:741)
>         at
> org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
>         at
> org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
>         at
> org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1021)
>         at
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1064)
>
> (the above is repeated for the other 2 ZK servers configured in this 
> cluster as well). The JDK item I found (again, this is all 
> circumstantial, and I have not spent any time to dig deep) was 
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugs
> .openjdk.java.net%2Fbrowse%2FJDK-8225499&amp;data=02%7C01%7Carvindsh%4
> 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91
> ab2d7cd011db47%7C1%7C0%7C637250533625323572&amp;sdata=TmSrpgDIlpxltWzH7GGduHqHARmonstwmorQ94yh0o0%3D&amp;reserved=0. The associated note in JDK 14 release notes (https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjdk.java.net%2F14%2Frelease-notes&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=RxZnt3FJ1Jn0fNzTmDM48o5JT%2FAYka8N3Aur6i1frPA%3D&amp;reserved=0) has the below text, which seemed interesting as it kind of matches up with the symptoms above:
>
> Additionally, the string format for unresolved addresses has been changed.
> The method now represents the literal IP address with the token 
> <unresolved>, for example: foo/<unresolved>:80 instead of foo:80. This 
> is based on InetAddress::toString, which returns a string of the form 
> "hostname / literal IP address". To retrieve a string representation 
> of the hostname, or the string form of the address if it doesn't have 
> a hostname, use InetSocketAddress::getHostString, rather than parsing 
> the string representation.
>
> - Arvind
>
> From: Arvind Shyamsundar <ar...@microsoft.com>
> Sent: Wednesday, May 6, 2020 1:09 AM
> To: dev@fluo.apache.org
> Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
>
> Incidentally I did a brief test yesterday with Muchos and jdk 14. I 
> saw similar issues with zk client (within Hadoop zkfc) being unable to 
> connect to zk, complaining about an unresolved address. I did find a 
> vaguely related documented change in jdk14 for ipv6 addresses but gave 
> up after a while. I can send details, on Wednesday.
> Sent from Outlook Mobile<https://aka.ms/blhgte>
>
> ________________________________
> From: Christopher <ct...@apache.org>>
> Sent: Wednesday, May 6, 2020 12:50:39 AM
> To: fluo-dev <de...@fluo.apache.org>>
> Subject: [EXTERNAL] Re: JDK14 and Fluo
>
> So, a quick follow-up to this:
>
> The CMS flags seem to be ignored, so that's not really a problem:
>
> OpenJDK 64-Bit Server VM warning: Ignoring option UseConcMarkSweepGC; 
> support was removed in 14.0 OpenJDK 64-Bit Server VM warning: Ignoring 
> option CMSInitiatingOccupancyFraction; support was removed in 14.0
>
> These are just warnings about the flags being ignored.
>
> I may be having other problems on my machine that prevent the tests 
> from running. I'm not sure, but it seems that Accumulo's Initialize is 
> failing to connect to the ZooKeeper server within the 30s timeout 
> period. There does not appear to be any problems with ZooKeeper 
> itself, and I can use telnet to send `ruok`. My machine might just be 
> slow... I'll have to troubleshoot later.
>
> On Wed, May 6, 2020 at 2:52 AM Christopher <ctubbsii@apache.org<mailto:
> ctubbsii@apache.org>> wrote:
> >
> > I have switched my development environment to primarily run with 
> > JDK14 and noticed some incompatibilities with Fluo.
> >
> > Some of these are now fixed in a PR:
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> ub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40mic
> rosoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91ab2d
> 7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=TkVp97HUB3ugh%2FYqd1
> 8asT8qJIukZxCoT8g4p8AT40M%3D&amp;reserved=0
> >
> > However, this doesn't completely fix JDK14 builds. Specifically, 
> > JDK14 removed the Concurrent Mark Sweep garbage collector. This 
> > garbage collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> > Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests 
> > cannot run in Fluo on JDK14.
> >
> > This issue has already been addressed in upstream Accumulo (
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgith
> ub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh%4
> 0microsoft.com%7Cf4c61036a9714c7045bb08d7f7fbe87b%7C72f988bf86f141af91
> ab2d7cd011db47%7C1%7C0%7C637250533625328563&amp;sdata=D8n09flrBYZVAwUV
> fYcOYNIni3Qgk%2BRuJkTK8OnX6xQ%3D&amp;reserved=0)
> for Accumulo 2.1.0.
> >
> > My proposed resolution is that Fluo should use an updated version of 
> > accumulo2-maven-plugin, after Accumulo 2.1.0 is released with the 
> > MiniAccumuloCluster improvements. Of course, this requires Accumulo
> > 2.1.0 to be released first.
>

Re: [EXTERNAL] Re: JDK14 and Fluo

Posted by Christopher <ct...@apache.org>.
Just a follow up from this. As far as I can tell, the problem I was having
was related to my system's hostname resolver, and that jdk14 works fine.

On Wed, May 6, 2020, 18:10 Arvind Shyamsundar
<ar...@microsoft.com.invalid> wrote:

> FWIW, sharing my notes - here's what I see when attempting to setup an
> cluster (Accumulo-2.1.0-SNAPSHOT, Hadoop 3.2.1, ZooKeeper 3.5.7) with
> OpenJDK 14. The step where the Hadoop ZKFC (hdfs zkfc -formatZK -force) is
> configured fails with the below exception, which seemed vaguely similar to
> Christopher's report of Accumulo being unable to connect to ZK.
>
> 2020-05-06 22:00:47,518 INFO zookeeper.ClientCnxn: Opening socket
> connection to server leader-3/<unresolved>:2181. Will not attempt to
> authenticate using SASL (unknown error)
> 2020-05-06 22:00:47,518 WARN zookeeper.ClientCnxn: Session 0x0 for server
> leader-3/<unresolved>:2181, unexpected error, closing socket connection and
> attempting reconnect
> java.nio.channels.UnresolvedAddressException
>         at java.base/sun.nio.ch.Net.checkAddress(Net.java:139)
>         at java.base/sun.nio.ch
> .SocketChannelImpl.checkRemote(SocketChannelImpl.java:727)
>         at java.base/sun.nio.ch
> .SocketChannelImpl.connect(SocketChannelImpl.java:741)
>         at
> org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
>         at
> org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
>         at
> org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1021)
>         at
> org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1064)
>
> (the above is repeated for the other 2 ZK servers configured in this
> cluster as well). The JDK item I found (again, this is all circumstantial,
> and I have not spent any time to dig deep) was
> https://bugs.openjdk.java.net/browse/JDK-8225499. The associated note in
> JDK 14 release notes (https://jdk.java.net/14/release-notes) has the
> below text, which seemed interesting as it kind of matches up with the
> symptoms above:
>
> Additionally, the string format for unresolved addresses has been changed.
> The method now represents the literal IP address with the token
> <unresolved>, for example: foo/<unresolved>:80 instead of foo:80. This is
> based on InetAddress::toString, which returns a string of the form
> "hostname / literal IP address". To retrieve a string representation of the
> hostname, or the string form of the address if it doesn't have a hostname,
> use InetSocketAddress::getHostString, rather than parsing the string
> representation.
>
> - Arvind
>
> From: Arvind Shyamsundar <ar...@microsoft.com>
> Sent: Wednesday, May 6, 2020 1:09 AM
> To: dev@fluo.apache.org
> Subject: Re: [EXTERNAL] Re: JDK14 and Fluo
>
> Incidentally I did a brief test yesterday with Muchos and jdk 14. I saw
> similar issues with zk client (within Hadoop zkfc) being unable to connect
> to zk, complaining about an unresolved address. I did find a vaguely
> related documented change in jdk14 for ipv6 addresses but gave up after a
> while. I can send details, on Wednesday.
> Sent from Outlook Mobile<https://aka.ms/blhgte>
>
> ________________________________
> From: Christopher <ct...@apache.org>>
> Sent: Wednesday, May 6, 2020 12:50:39 AM
> To: fluo-dev <de...@fluo.apache.org>>
> Subject: [EXTERNAL] Re: JDK14 and Fluo
>
> So, a quick follow-up to this:
>
> The CMS flags seem to be ignored, so that's not really a problem:
>
> OpenJDK 64-Bit Server VM warning: Ignoring option UseConcMarkSweepGC;
> support was removed in 14.0
> OpenJDK 64-Bit Server VM warning: Ignoring option
> CMSInitiatingOccupancyFraction; support was removed in 14.0
>
> These are just warnings about the flags being ignored.
>
> I may be having other problems on my machine that prevent the tests
> from running. I'm not sure, but it seems that Accumulo's Initialize is
> failing to connect to the ZooKeeper server within the 30s timeout
> period. There does not appear to be any problems with ZooKeeper
> itself, and I can use telnet to send `ruok`. My machine might just be
> slow... I'll have to troubleshoot later.
>
> On Wed, May 6, 2020 at 2:52 AM Christopher <ctubbsii@apache.org<mailto:
> ctubbsii@apache.org>> wrote:
> >
> > I have switched my development environment to primarily run with JDK14
> > and noticed some incompatibilities with Fluo.
> >
> > Some of these are now fixed in a PR:
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=joEvtJp2kDgS%2F8iI9qNh%2F8NL2qVgo7V37DuP49NfWxw%3D&amp;reserved=0
> >
> > However, this doesn't completely fix JDK14 builds. Specifically, JDK14
> > removed the Concurrent Mark Sweep garbage collector. This garbage
> > collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> > Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests
> > cannot run in Fluo on JDK14.
> >
> > This issue has already been addressed in upstream Accumulo
> > (
> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=GzzmfMgNvw135zQhKSVmara4uBu6%2BgVfHE7YtXlhSjs%3D&amp;reserved=0)
> for Accumulo 2.1.0.
> >
> > My proposed resolution is that Fluo should use an updated version of
> > accumulo2-maven-plugin, after Accumulo 2.1.0 is released with the
> > MiniAccumuloCluster improvements. Of course, this requires Accumulo
> > 2.1.0 to be released first.
>

RE: [EXTERNAL] Re: JDK14 and Fluo

Posted by Arvind Shyamsundar <ar...@microsoft.com.INVALID>.
FWIW, sharing my notes - here's what I see when attempting to setup an cluster (Accumulo-2.1.0-SNAPSHOT, Hadoop 3.2.1, ZooKeeper 3.5.7) with OpenJDK 14. The step where the Hadoop ZKFC (hdfs zkfc -formatZK -force) is configured fails with the below exception, which seemed vaguely similar to Christopher's report of Accumulo being unable to connect to ZK.

2020-05-06 22:00:47,518 INFO zookeeper.ClientCnxn: Opening socket connection to server leader-3/<unresolved>:2181. Will not attempt to authenticate using SASL (unknown error)
2020-05-06 22:00:47,518 WARN zookeeper.ClientCnxn: Session 0x0 for server leader-3/<unresolved>:2181, unexpected error, closing socket connection and attempting reconnect
java.nio.channels.UnresolvedAddressException
        at java.base/sun.nio.ch.Net.checkAddress(Net.java:139)
        at java.base/sun.nio.ch.SocketChannelImpl.checkRemote(SocketChannelImpl.java:727)
        at java.base/sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:741)
        at org.apache.zookeeper.ClientCnxnSocketNIO.registerAndConnect(ClientCnxnSocketNIO.java:277)
        at org.apache.zookeeper.ClientCnxnSocketNIO.connect(ClientCnxnSocketNIO.java:287)
        at org.apache.zookeeper.ClientCnxn$SendThread.startConnect(ClientCnxn.java:1021)
        at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1064)

(the above is repeated for the other 2 ZK servers configured in this cluster as well). The JDK item I found (again, this is all circumstantial, and I have not spent any time to dig deep) was https://bugs.openjdk.java.net/browse/JDK-8225499. The associated note in JDK 14 release notes (https://jdk.java.net/14/release-notes) has the below text, which seemed interesting as it kind of matches up with the symptoms above:

Additionally, the string format for unresolved addresses has been changed. The method now represents the literal IP address with the token <unresolved>, for example: foo/<unresolved>:80 instead of foo:80. This is based on InetAddress::toString, which returns a string of the form "hostname / literal IP address". To retrieve a string representation of the hostname, or the string form of the address if it doesn't have a hostname, use InetSocketAddress::getHostString, rather than parsing the string representation.

- Arvind

From: Arvind Shyamsundar <ar...@microsoft.com>
Sent: Wednesday, May 6, 2020 1:09 AM
To: dev@fluo.apache.org
Subject: Re: [EXTERNAL] Re: JDK14 and Fluo

Incidentally I did a brief test yesterday with Muchos and jdk 14. I saw similar issues with zk client (within Hadoop zkfc) being unable to connect to zk, complaining about an unresolved address. I did find a vaguely related documented change in jdk14 for ipv6 addresses but gave up after a while. I can send details, on Wednesday.
Sent from Outlook Mobile<https://aka.ms/blhgte>

________________________________
From: Christopher <ct...@apache.org>>
Sent: Wednesday, May 6, 2020 12:50:39 AM
To: fluo-dev <de...@fluo.apache.org>>
Subject: [EXTERNAL] Re: JDK14 and Fluo

So, a quick follow-up to this:

The CMS flags seem to be ignored, so that's not really a problem:

OpenJDK 64-Bit Server VM warning: Ignoring option UseConcMarkSweepGC;
support was removed in 14.0
OpenJDK 64-Bit Server VM warning: Ignoring option
CMSInitiatingOccupancyFraction; support was removed in 14.0

These are just warnings about the flags being ignored.

I may be having other problems on my machine that prevent the tests
from running. I'm not sure, but it seems that Accumulo's Initialize is
failing to connect to the ZooKeeper server within the 30s timeout
period. There does not appear to be any problems with ZooKeeper
itself, and I can use telnet to send `ruok`. My machine might just be
slow... I'll have to troubleshoot later.

On Wed, May 6, 2020 at 2:52 AM Christopher <ct...@apache.org>> wrote:
>
> I have switched my development environment to primarily run with JDK14
> and noticed some incompatibilities with Fluo.
>
> Some of these are now fixed in a PR: https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Ffluo%2Fpull%2F1093&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=joEvtJp2kDgS%2F8iI9qNh%2F8NL2qVgo7V37DuP49NfWxw%3D&amp;reserved=0
>
> However, this doesn't completely fix JDK14 builds. Specifically, JDK14
> removed the Concurrent Mark Sweep garbage collector. This garbage
> collector is hard-coded in MiniAccumuloCluster 2.0.0 and earlier.
> Since accumulo2-maven-plugin uses MiniAccumuloCluster, the tests
> cannot run in Fluo on JDK14.
>
> This issue has already been addressed in upstream Accumulo
> (https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fapache%2Faccumulo%2Fpull%2F1427&amp;data=02%7C01%7Carvindsh%40microsoft.com%7Cf1ca22cdd55e4513483c08d7f19235c2%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637243482584185299&amp;sdata=GzzmfMgNvw135zQhKSVmara4uBu6%2BgVfHE7YtXlhSjs%3D&amp;reserved=0) for Accumulo 2.1.0.
>
> My proposed resolution is that Fluo should use an updated version of
> accumulo2-maven-plugin, after Accumulo 2.1.0 is released with the
> MiniAccumuloCluster improvements. Of course, this requires Accumulo
> 2.1.0 to be released first.