You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@storm.apache.org by "Jungtaek Lim (JIRA)" <ji...@apache.org> on 2016/08/17 21:58:20 UTC

[jira] [Resolved] (STORM-1915) Supervisor keeps restarting forever

     [ https://issues.apache.org/jira/browse/STORM-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jungtaek Lim resolved STORM-1915.
---------------------------------
       Resolution: Fixed
         Assignee: Jungtaek Lim
    Fix Version/s: 1.1.0
                   1.0.2
                   2.0.0

Resolving this since STORM-1934 was merged.

> Supervisor keeps restarting forever
> -----------------------------------
>
>                 Key: STORM-1915
>                 URL: https://issues.apache.org/jira/browse/STORM-1915
>             Project: Apache Storm
>          Issue Type: Bug
>          Components: storm-core
>    Affects Versions: 1.0.1
>         Environment: Linode 4GB running on KVM - Ubuntu 14.04 LTS
>            Reporter: Gergely Nagy
>            Assignee: Jungtaek Lim
>             Fix For: 2.0.0, 1.0.2, 1.1.0
>
>
> While submitting a topology to a 20 node 40 worker strong cluster, the supervisor keeps throwing errors and keeps restarting the workers it is supervising.
> For this reason the topology never starts, instead it keeps dancing by reassigning the bolts and spouts forever.
> I'd love to attach the logs here but I can't find any upload button in the JIRA form.
> The error basically says:
> {code}
> 2016-06-18 12:04:26.589 o.a.s.config [WARN] Failed to get worker user for . #error {
>  :cause /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
>  :via
>  [{:type java.io.FileNotFoundException
>    :message /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
>    :at [java.io.FileInputStream open0 FileInputStream.java -2]}]
>  :trace
>  [[java.io.FileInputStream open0 FileInputStream.java -2]
>   [java.io.FileInputStream open FileInputStream.java 195]
>   [java.io.FileInputStream <init> FileInputStream.java 138]
>   [clojure.java.io$fn__9189 invoke io.clj 229]
>   [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
>   [clojure.java.io$fn__9201 invoke io.clj 258]
>   [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
>   [clojure.java.io$fn__9163 invoke io.clj 165]
>   [clojure.java.io$fn__9115$G__9091__9122 invoke io.clj 69]
>   [clojure.java.io$reader doInvoke io.clj 102]
>   [clojure.lang.RestFn invoke RestFn.java 410]
>   [clojure.lang.AFn applyToHelper AFn.java 154]
>   [clojure.lang.RestFn applyTo RestFn.java 132]
>   [clojure.core$apply invoke core.clj 632]
>   [clojure.core$slurp doInvoke core.clj 6653]
>   [clojure.lang.RestFn invoke RestFn.java 410]
>   [org.apache.storm.config$get_worker_user invoke config.clj 239]
>   [org.apache.storm.daemon.supervisor$shutdown_worker invoke supervisor.clj 281]
>   [org.apache.storm.daemon.supervisor$kill_existing_workers_with_change_in_components invoke supervisor.clj 536]
>   [org.apache.storm.daemon.supervisor$mk_synchronize_supervisor$this__9078 invoke supervisor.clj 595]
>   [org.apache.storm.event$event_manager$fn__8630 invoke event.clj 40]
>   [clojure.lang.AFn run AFn.java 22]
>   [java.lang.Thread run Thread.java 745]]}
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)