You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Dawid Wysakowicz (Jira)" <ji...@apache.org> on 2021/01/26 07:56:00 UTC

[jira] [Updated] (FLINK-21147) Resuming Savepoint (file, async, no parallelism change) fails with UnknownHostException

     [ https://issues.apache.org/jira/browse/FLINK-21147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dawid Wysakowicz updated FLINK-21147:
-------------------------------------
    Labels: test-stability  (was: )

> Resuming Savepoint (file, async, no parallelism change) fails with UnknownHostException
> ---------------------------------------------------------------------------------------
>
>                 Key: FLINK-21147
>                 URL: https://issues.apache.org/jira/browse/FLINK-21147
>             Project: Flink
>          Issue Type: Bug
>          Components: Tests
>    Affects Versions: 1.11.4
>            Reporter: Dawid Wysakowicz
>            Priority: Major
>              Labels: test-stability
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=12484&view=logs&j=c88eea3b-64a0-564d-0031-9fdcd7b8abee&t=2b7514ee-e706-5046-657b-3430666e7bd9
> {code}
> 2021-01-25 21:31:14,388 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: jobmanager.rpc.address, localhost
> 2021-01-25 21:31:14,399 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: jobmanager.rpc.port, 6123
> 2021-01-25 21:31:14,399 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: jobmanager.memory.process.size, 1600m
> 2021-01-25 21:31:14,400 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: taskmanager.memory.process.size, 1728m
> 2021-01-25 21:31:14,400 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: parallelism.default, 1
> 2021-01-25 21:31:14,400 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: jobmanager.execution.failover-strategy, region
> 2021-01-25 21:31:14,400 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: taskmanager.numberOfTaskSlots, 2
> 2021-01-25 21:31:14,400 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: metrics.fetcher.update-interval, 2000
> 2021-01-25 21:31:14,401 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: metrics.reporter.slf4j.factory.class, org.apache.flink.metrics.slf4j.Slf4jReporterFactory
> 2021-01-25 21:31:14,401 INFO  org.apache.flink.configuration.GlobalConfiguration           [] - Loading configuration property: metrics.reporter.slf4j.interval, 1 SECONDS
> 2021-01-25 21:31:14,470 INFO  org.apache.flink.core.fs.FileSystem                          [] - Hadoop is not in the classpath/dependencies. The extended set of supported File Systems via Hadoop is not available.
> 2021-01-25 21:31:14,535 ERROR org.apache.flink.core.fs.local.LocalFileSystem               [] - Could not resolve local host
> java.net.UnknownHostException: fv-az227-139: fv-az227-139: Name or service not known
> 	at java.net.InetAddress.getLocalHost(InetAddress.java:1506) ~[?:1.8.0_275]
> 	at org.apache.flink.core.fs.local.LocalFileSystem.<init>(LocalFileSystem.java:95) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.core.fs.local.LocalFileSystem.<clinit>(LocalFileSystem.java:71) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.core.fs.local.LocalFileSystemFactory.getScheme(LocalFileSystemFactory.java:33) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.core.fs.FileSystem.initialize(FileSystem.java:344) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerSecurely(TaskManagerRunner.java:374) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerSecurely(TaskManagerRunner.java:360) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.main(TaskManagerRunner.java:336) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> Caused by: java.net.UnknownHostException: fv-az227-139: Name or service not known
> 	at java.net.Inet6AddressImpl.lookupAllHostAddr(Native Method) ~[?:1.8.0_275]
> 	at java.net.InetAddress$2.lookupAllHostAddr(InetAddress.java:929) ~[?:1.8.0_275]
> 	at java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1324) ~[?:1.8.0_275]
> 	at java.net.InetAddress.getLocalHost(InetAddress.java:1501) ~[?:1.8.0_275]
> 	... 7 more
> 2021-01-25 21:31:14,599 INFO  org.apache.flink.runtime.security.modules.HadoopModuleFactory [] - Cannot create Hadoop Security Module because Hadoop cannot be found in the Classpath.
> 2021-01-25 21:31:14,603 INFO  org.apache.flink.runtime.security.modules.JaasModule         [] - Jaas file will be created as /tmp/jaas-4712731418375480039.conf.
> 2021-01-25 21:31:14,630 INFO  org.apache.flink.runtime.security.contexts.HadoopSecurityContextFactory [] - Cannot install HadoopSecurityContext because Hadoop cannot be found in the Classpath.
> 2021-01-25 21:31:14,698 INFO  org.apache.flink.configuration.Configuration                 [] - Config uses fallback configuration key 'jobmanager.rpc.address' instead of key 'rest.address'
> 2021-01-25 21:31:14,711 INFO  org.apache.flink.runtime.util.LeaderRetrievalUtils           [] - Trying to select the network interface and address to use by connecting to the leading JobManager.
> 2021-01-25 21:31:14,711 INFO  org.apache.flink.runtime.util.LeaderRetrievalUtils           [] - TaskManager will try to connect for PT10S before falling back to heuristics
> 2021-01-25 21:31:14,802 WARN  org.apache.flink.runtime.net.ConnectionUtils                 [] - Could not resolve local hostname to an IP address: fv-az227-139: fv-az227-139: Name or service not known
> 2021-01-25 21:31:14,815 ERROR org.apache.flink.runtime.taskexecutor.TaskManagerRunner      [] - TaskManager initialization failed.
> org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not find the connecting address by connecting to the current leader.
> 	at org.apache.flink.runtime.util.LeaderRetrievalUtils.findConnectingAddress(LeaderRetrievalUtils.java:95) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.determineTaskManagerBindAddressByConnectingToResourceManager(TaskManagerRunner.java:547) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.determineTaskManagerBindAddress(TaskManagerRunner.java:535) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.createRpcService(TaskManagerRunner.java:516) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.<init>(TaskManagerRunner.java:146) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManager(TaskManagerRunner.java:347) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.lambda$runTaskManagerSecurely$3(TaskManagerRunner.java:381) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerSecurely(TaskManagerRunner.java:379) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerSecurely(TaskManagerRunner.java:360) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.taskexecutor.TaskManagerRunner.main(TaskManagerRunner.java:336) [flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> Caused by: org.apache.flink.runtime.leaderretrieval.LeaderRetrievalException: Could not retrieve the connecting address to the current leader with the akka URL akka.tcp://flink@localhost:6123/user/rpc/resourcemanager_*.
> 	at org.apache.flink.runtime.net.ConnectionUtils$LeaderConnectingAddressListener.findConnectingAddress(ConnectionUtils.java:507) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.net.ConnectionUtils$LeaderConnectingAddressListener.findConnectingAddress(ConnectionUtils.java:386) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.util.LeaderRetrievalUtils.findConnectingAddress(LeaderRetrievalUtils.java:93) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	... 10 more
> Caused by: java.net.UnknownHostException: fv-az227-139: fv-az227-139: Name or service not known
> 	at java.net.InetAddress.getLocalHost(InetAddress.java:1506) ~[?:1.8.0_275]
> 	at org.apache.flink.runtime.net.ConnectionUtils.tryLocalHostBeforeReturning(ConnectionUtils.java:191) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.net.ConnectionUtils.findAddressUsingStrategy(ConnectionUtils.java:279) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.net.ConnectionUtils$LeaderConnectingAddressListener.findConnectingAddress(ConnectionUtils.java:441) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.net.ConnectionUtils$LeaderConnectingAddressListener.findConnectingAddress(ConnectionUtils.java:386) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	at org.apache.flink.runtime.util.LeaderRetrievalUtils.findConnectingAddress(LeaderRetrievalUtils.java:93) ~[flink-dist_2.11-1.11-SNAPSHOT.jar:1.11-SNAPSHOT]
> 	... 10 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)