You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@storm.apache.org by Julien Nioche <li...@gmail.com> on 2014/01/24 09:49:22 UTC

Components throwing java.net.UnknownHostException

Hi,

One of my bolts and my spout periodically cause the following exception

java.lang.RuntimeException: java.net.UnknownHostException:
Mint15-Vostro-410: Mint15-Vostro-410
	at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:90)
	at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:61)
	at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
	at backtype.storm.daemon.executor$fn__3498$fn__3510$fn__3557.invoke(executor.clj:730)
	at backtype.storm.util$async_loop$fn__444.invoke(util.clj:403)
	at clojure.lang.AFn.run(AFn.java:24)
	at java.lang.Thread.run(Thread.java:662)
Caused by: java.net.UnknownHostException: Mint15-Vostro-410: Mint15-Vostro-410
	at java.net.InetAddress.getLocalHost(InetAddress.java:1402)
	at backtype.storm.daemon.executor$metrics_tick.invoke(executor.clj:264)
	at backtype.storm.daemon.executor$fn__3498$tuple_action_fn__3500.invoke(executor.clj:604)
	at backtype.storm.daemon.executor$mk_task_receiver$fn__3421.invoke(executor.clj:386)
	at backtype.storm.disruptor$clojure_handler$reify__2962.onEvent(disruptor.clj:43)
	at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:87)
	... 6 more

*cat /etc/hosts*
127.0.0.1 localhost
127.0.1.1 Mint15-Vostro-410

*storm.yaml*
*storm.messaging.transport: "backtype.storm.messaging.netty.Context"*
*storm.messaging.netty.server_worker_threads: 1*
*storm.messaging.netty.client_worker_threads: 1*
*storm.messaging.netty.buffer_size: 5242880*
*storm.messaging.netty.max_retries: 10*
*storm.messaging.netty.max_wait_ms: 1000*
*storm.messaging.netty.min_wait_ms: 100*


The topology works OK but the workers seem to get restarted during the
execution. Not clear whether it is related to the exception above or not.

Any idea?

Thanks

Julien

-- 

Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Re: Components throwing java.net.UnknownHostException

Posted by Julien Nioche <li...@gmail.com>.
I've looked at bit more into the logs. The workers are restarted after they
fail to send a heartbeat, their logs contain

*2014-01-24 11:10:15 b.s.m.n.StormClientHandler [INFO] Connection failed:*
*java.io.IOException: Connection reset by peer*
* at sun.nio.ch.FileDispatcher.read0(Native Method) ~[na:1.6.0_45]*
* at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:21)
~[na:1.6.0_45]*
* at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:198) ~[na:1.6.0_45]*
* at sun.nio.ch.IOUtil.read(IOUtil.java:166) ~[na:1.6.0_45]*
* at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:245)
~[na:1.6.0_45]*
* at org.jboss.netty.channel.socket.nio.NioWorker.read(NioWorker.java:64)
~[netty-3.6.3.Final.jar:na]*
* at
org.jboss.netty.channel.socket.nio.AbstractNioWorker.process(AbstractNioWorker.java:107)
~[netty-3.6.3.Final.jar:na]*
* at
org.jboss.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:312)
~[netty-3.6.3.Final.jar:na]*
* at
org.jboss.netty.channel.socket.nio.AbstractNioWorker.run(AbstractNioWorker.java:88)
~[netty-3.6.3.Final.jar:na]*
* at org.jboss.netty.channel.socket.nio.NioWorker.run(NioWorker.java:178)
~[netty-3.6.3.Final.jar:na]*
* at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
[na:1.6.0_45]*
* at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
[na:1.6.0_45]*
* at java.lang.Thread.run(Thread.java:662) [na:1.6.0_45]*

followed by 10 attempts to reconnect and finally

*2014-01-24 11:10:24 b.s.util [ERROR] Async loop died!*
*java.lang.RuntimeException: java.lang.RuntimeException: Client is being
closed, and does not take requests any more*
* at
backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:90)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:61)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.disruptor$consume_loop_STAR_$fn__2975.invoke(disruptor.clj:74)
~[storm-core-0.9.0.1.jar:na]*
* at backtype.storm.util$async_loop$fn__444.invoke(util.clj:403)
~[storm-core-0.9.0.1.jar:na]*
* at clojure.lang.AFn.run(AFn.java:24) [clojure-1.4.0.jar:na]*
* at java.lang.Thread.run(Thread.java:662) [na:1.6.0_45]*
*Caused by: java.lang.RuntimeException: Client is being closed, and does
not take requests any more*
* at backtype.storm.messaging.netty.Client.send(Client.java:109)
~[storm-netty-0.9.0.1.jar:na]*
* at
backtype.storm.daemon.worker$mk_transfer_tuples_handler$fn__5867$fn__5868.invoke(worker.clj:304)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.daemon.worker$mk_transfer_tuples_handler$fn__5867.invoke(worker.clj:293)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.disruptor$clojure_handler$reify__2962.onEvent(disruptor.clj:43)
~[storm-core-0.9.0.1.jar:na]*
* at
backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:87)
~[storm-core-0.9.0.1.jar:na]*
* ... 6 common frames omitted*
*2014-01-24 11:10:24 b.s.util [INFO] Halting process: ("Async loop died!")*

This does not seem related to the unknownhost problem I mentioned earlier.

Thanks



On 24 January 2014 08:49, Julien Nioche <li...@gmail.com>wrote:

> Hi,
>
> One of my bolts and my spout periodically cause the following exception
>
> java.lang.RuntimeException: java.net.UnknownHostException: Mint15-Vostro-410: Mint15-Vostro-410
> 	at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:90)
> 	at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:61)
> 	at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
> 	at backtype.storm.daemon.executor$fn__3498$fn__3510$fn__3557.invoke(executor.clj:730)
> 	at backtype.storm.util$async_loop$fn__444.invoke(util.clj:403)
> 	at clojure.lang.AFn.run(AFn.java:24)
> 	at java.lang.Thread.run(Thread.java:662)
> Caused by: java.net.UnknownHostException: Mint15-Vostro-410: Mint15-Vostro-410
> 	at java.net.InetAddress.getLocalHost(InetAddress.java:1402)
> 	at backtype.storm.daemon.executor$metrics_tick.invoke(executor.clj:264)
> 	at backtype.storm.daemon.executor$fn__3498$tuple_action_fn__3500.invoke(executor.clj:604)
> 	at backtype.storm.daemon.executor$mk_task_receiver$fn__3421.invoke(executor.clj:386)
> 	at backtype.storm.disruptor$clojure_handler$reify__2962.onEvent(disruptor.clj:43)
> 	at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:87)
> 	... 6 more
>
> *cat /etc/hosts*
> 127.0.0.1 localhost
> 127.0.1.1 Mint15-Vostro-410
>
> *storm.yaml*
> *storm.messaging.transport: "backtype.storm.messaging.netty.Context"*
> *storm.messaging.netty.server_worker_threads: 1*
> *storm.messaging.netty.client_worker_threads: 1*
> *storm.messaging.netty.buffer_size: 5242880*
> *storm.messaging.netty.max_retries: 10*
> *storm.messaging.netty.max_wait_ms: 1000*
> *storm.messaging.netty.min_wait_ms: 100*
>
>
> The topology works OK but the workers seem to get restarted during the
> execution. Not clear whether it is related to the exception above or not.
>
> Any idea?
>
> Thanks
>
> Julien
>
> --
>
> Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
> http://twitter.com/digitalpebble
>



-- 

Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble