You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@storm.apache.org by "Gergely Nagy (JIRA)" <ji...@apache.org> on 2016/06/18 19:38:05 UTC

[jira] [Created] (STORM-1915) Supervisor keeps restarting forever

Gergely Nagy created STORM-1915:
-----------------------------------

             Summary: Supervisor keeps restarting forever
                 Key: STORM-1915
                 URL: https://issues.apache.org/jira/browse/STORM-1915
             Project: Apache Storm
          Issue Type: Bug
          Components: storm-core
    Affects Versions: 1.0.1
         Environment: Linode 4GB running on KVM - Ubuntu 14.04 LTS
            Reporter: Gergely Nagy


While submitting a topology with to a 20 node 40 worker strong cluster, the supervisor keeps throwing errors and keeps restarting the workers it is supervising.

For this reason the topology never starts, instead it keeps dancing by reassigning the bolts and spouts forever.

I'd love to attach the logs here but I can't find any upload button in the JIRA form.

The error basically says:
{code}
2016-06-18 12:04:26.589 o.a.s.config [WARN] Failed to get worker user for . #error {
 :cause /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
 :via
 [{:type java.io.FileNotFoundException
   :message /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a directory)
   :at [java.io.FileInputStream open0 FileInputStream.java -2]}]
 :trace
 [[java.io.FileInputStream open0 FileInputStream.java -2]
  [java.io.FileInputStream open FileInputStream.java 195]
  [java.io.FileInputStream <init> FileInputStream.java 138]
  [clojure.java.io$fn__9189 invoke io.clj 229]
  [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
  [clojure.java.io$fn__9201 invoke io.clj 258]
  [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69]
  [clojure.java.io$fn__9163 invoke io.clj 165]
  [clojure.java.io$fn__9115$G__9091__9122 invoke io.clj 69]
  [clojure.java.io$reader doInvoke io.clj 102]
  [clojure.lang.RestFn invoke RestFn.java 410]
  [clojure.lang.AFn applyToHelper AFn.java 154]
  [clojure.lang.RestFn applyTo RestFn.java 132]
  [clojure.core$apply invoke core.clj 632]
  [clojure.core$slurp doInvoke core.clj 6653]
  [clojure.lang.RestFn invoke RestFn.java 410]
  [org.apache.storm.config$get_worker_user invoke config.clj 239]
  [org.apache.storm.daemon.supervisor$shutdown_worker invoke supervisor.clj 281]
  [org.apache.storm.daemon.supervisor$kill_existing_workers_with_change_in_components invoke supervisor.clj 536]
  [org.apache.storm.daemon.supervisor$mk_synchronize_supervisor$this__9078 invoke supervisor.clj 595]
  [org.apache.storm.event$event_manager$fn__8630 invoke event.clj 40]
  [clojure.lang.AFn run AFn.java 22]
  [java.lang.Thread run Thread.java 745]]}
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)