You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@airavata.apache.org by "Eroma (JIRA)" <ji...@apache.org> on 2016/01/19 17:35:40 UTC

[jira] [Closed] (AIRAVATA-1651) Zookeeper connection lost error; Experiment failed

     [ https://issues.apache.org/jira/browse/AIRAVATA-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eroma closed AIRAVATA-1651.
---------------------------

This is no longer re-creatable with airaveata 0.16
https://dev.testdrive.airavata.org

> Zookeeper connection lost error; Experiment failed
> --------------------------------------------------
>
>                 Key: AIRAVATA-1651
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1651
>             Project: Airavata
>          Issue Type: Bug
>         Environment: http://test-drive.airavata.org/pga/public
>            Reporter: Eroma
>            Assignee: Shameera Rathnayaka
>             Fix For: 0.15 
>
>
> Two experiment has the same error message in log
> One experiment got FAILED at experiment level and no job status recorded.
> Other Experiment failed but the job got COMPLETE. Randomely occurs. was unable to recreate
> error messages retrived from log;
> 2015-03-26 09:33:34,693 [main-SendThread(gw127.iu.xsede.org:9181)] INFO  org.apache.zookeeper.ClientCnxn  - Opening socket connection to server gw127.iu.xsede.org/149.165.228.125:9181
> ...skipping...
> org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-WRF-Stampede_c0697813-a8f4-4d8a-b0f3-6808f8538b18+IDontNeedaNode_a3b6133f-f8af-435d-9b2a-76838db535f6/org.apache.airavata.gfac.gsissh.handler.GSISSHInputHandler/state
>        at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>        at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1228)
>        at org.apache.airavata.gfac.core.utils.GFacUtils.updatePluginState(GFacUtils.java:1013)
>        at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeInFlowHandlers(BetterGfacImpl.java:902)
>        at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.launch(BetterGfacImpl.java:690)
>        at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:481)
>        at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.submitJob(BetterGfacImpl.java:210)
>        at org.apache.airavata.gfac.core.utils.InputHandlerWorker.call(InputHandlerWorker.java:49)
>        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
>        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>        at java.lang.Thread.run(Thread.java:745)
> and 
> aused by: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /gfac-experiments/gfac-node0/SLM-Trinity-Stampede_0bd73a38-6931-498f-af7b-d700dc177c43+IDontNeedaNode_db287294-796d-43c1-896d-e3b412b4c8a7/org.apache.airavata.gfac.ssh.handler.AdvancedSCPOutputHandler
>        at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>        at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1003)
>        at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1031)
>        at org.apache.airavata.gfac.core.utils.GFacUtils.createPluginZnode(GFacUtils.java:935)
>        at org.apache.airavata.gfac.core.cpi.BetterGfacImpl.invokeOutFlowHandlers(BetterGfacImpl.java:939)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)