You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@airavata.apache.org by "Eroma (JIRA)" <ji...@apache.org> on 2015/03/26 16:50:53 UTC

[jira] [Created] (AIRAVATA-1651) Zookeeper connection lost error; Experiment failed

Eroma created AIRAVATA-1651:
-------------------------------

             Summary: Zookeeper connection lost error; Experiment failed
                 Key: AIRAVATA-1651
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1651
             Project: Airavata
          Issue Type: Bug
         Environment: http://test-drive.airavata.org/pga/public
            Reporter: Eroma


Two experiment has the same error message in log
One experiment got FAILED at experiment level and no job status recorded.
Other Experiment failed but the job got COMPLETE. Randomely occurs. was unable to recreate

error messages retrived from log;
2015-03-26 09:33:34,693 [main-SendThread(gw127.iu.xsede.org:9181)] INFO  org.apache.zookeeper.ClientCnxn  - Opening socket connection to server gw127.iu.xsede.org/149.165.228.125:9181
...skipping...
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-WRF-Stampede_c0697813-a8f4-4d8a-b0f3-6808f8538b18+IDontNeedaNode_a3b6133f-f8af-435d-9b2a-76838db535f6/org.apache.airavata.gfac.gsissh.handler.GSISSHInputHandler/state
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
       at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228)
       at org.apache.airavata.gfac.core.utils.GFacUtils.updatePluginState(GFacUtils.java:1013)
       at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:902)
       at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690)
       at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481)
       at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210)
       at org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49)
       at java.util.concurrent.FutureTask.run(FutureTask.java:262)
       at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
       at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
       at java.lang.Thread.run(Thread.java:745)

and 

aused by: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-Trinity-Stampede_0bd73a38-6931-498f-af7b-d700dc177c43+IDontNeedaNode_db287294-796d-43c1-896d-e3b412b4c8a7/org.apache.airavata.gfac.ssh.handler.AdvancedSCPOutputHandler
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
       at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003)
       at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1031)
       at org.apache.airavata.gfac.core.utils.GFacUtils.createPluginZnode(GFacUtils.java:935)
       at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeOutFlowHandlers(BetterGfacImpl.java:939)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)