You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@airavata.apache.org by "Eroma (JIRA)" <ji...@apache.org> on 2015/06/09 22:57:01 UTC

[jira] [Updated] (AIRAVATA-1719) Experiment in EXECUTING state without a job status in PGA

     [ https://issues.apache.org/jira/browse/AIRAVATA-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eroma updated AIRAVATA-1719:
----------------------------
    Description: 
Steps 
1. Saved and launched batch of experiments to Gordon
2. One has EXECUTING experiment state without any job state or any error messages in PGA
3. In airavata log has the error [1]
4. Experiment folder does not exist in the resource worker directory or in the airavata /tmp
exp ID: SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a

[1]
[ERROR] KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:119)
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
        at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:778)
        at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:696)
        at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:679)
        at org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:107)
        at org.apache.curator.framework.imps.CreateBuilderImpl.pathInForeground(CreateBuilderImpl.java:676)
        at org.apache.curator.framework.imps.CreateBuilderImpl.protectedPathInForeground(CreateBuilderImpl.java:453)
        at org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:443)
        at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:251)
        at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:205)
        at org.apache.airavata.gfac.core.utils.GFacUtils.createExperimentEntryForPassive(GFacUtils.java:410)
        at org.apache.airavata.gfac.server.GfacServerHandler$TaskLaunchMessageHandler.onMessage(GfacServerHandler.java:326)
        at org.apache.airavata.messaging.core.impl.RabbitMQTaskLaunchConsumer$2.handleDelivery(RabbitMQTaskLaunchConsumer.java:195)
        at com.rabbitmq.client.impl.ConsumerDispatcher$5.run(ConsumerDispatcher.java:144)
        at com.rabbitmq.client.impl.ConsumerWorkService$WorkPoolRunnable.run(ConsumerWorkService.java:99)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)
*deliveryTag:168
 Message Received with message id 'LAUNCH.TASK-44e7f2fc-d206-4d9d-85cb-c03e7545a479' and with message type 'LAUNCHTASK


  was:
Steps 
1. Saved and launched batch of experiments to Gordon
2. One has EXECUTING experiment state without any job state or any error messages in PGA
3. In airavata log has the error [1]
4. Experiment folder does not exist in the rescue worker directory or in the airavata /tmp
exp ID: SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a

[1]
[ERROR] KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:119)
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
        at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:778)
        at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:696)
        at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:679)
        at org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:107)
        at org.apache.curator.framework.imps.CreateBuilderImpl.pathInForeground(CreateBuilderImpl.java:676)
        at org.apache.curator.framework.imps.CreateBuilderImpl.protectedPathInForeground(CreateBuilderImpl.java:453)
        at org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:443)
        at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:251)
        at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:205)
        at org.apache.airavata.gfac.core.utils.GFacUtils.createExperimentEntryForPassive(GFacUtils.java:410)
        at org.apache.airavata.gfac.server.GfacServerHandler$TaskLaunchMessageHandler.onMessage(GfacServerHandler.java:326)
        at org.apache.airavata.messaging.core.impl.RabbitMQTaskLaunchConsumer$2.handleDelivery(RabbitMQTaskLaunchConsumer.java:195)
        at com.rabbitmq.client.impl.ConsumerDispatcher$5.run(ConsumerDispatcher.java:144)
        at com.rabbitmq.client.impl.ConsumerWorkService$WorkPoolRunnable.run(ConsumerWorkService.java:99)
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)
*deliveryTag:168
 Message Received with message id 'LAUNCH.TASK-44e7f2fc-d206-4d9d-85cb-c03e7545a479' and with message type 'LAUNCHTASK



> Experiment in EXECUTING state without a job status in PGA
> ---------------------------------------------------------
>
>                 Key: AIRAVATA-1719
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1719
>             Project: Airavata
>          Issue Type: Bug
>          Components: Airavata System
>         Environment: http://dev.test-drive.airavata.org/portal/ultrascan-testing/public/
>            Reporter: Eroma
>
> Steps 
> 1. Saved and launched batch of experiments to Gordon
> 2. One has EXECUTING experiment state without any job state or any error messages in PGA
> 3. In airavata log has the error [1]
> 4. Experiment folder does not exist in the resource worker directory or in the airavata /tmp
> exp ID: SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
> [1]
> [ERROR] KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
> org.apache.zookeeper.KeeperException$NodeExistsException: KeeperErrorCode = NodeExists for /gfac-experiments/gfac-node1/SLM3-US-Gordon-06-05_16-09-03_c58f5064-e956-41bd-a6ac-bd37b2b0bf5a
>         at org.apache.zookeeper.KeeperException.create(KeeperException.java:119)
>         at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>         at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:778)
>         at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:696)
>         at org.apache.curator.framework.imps.CreateBuilderImpl$11.call(CreateBuilderImpl.java:679)
>         at org.apache.curator.RetryLoop.callWithRetry(RetryLoop.java:107)
>         at org.apache.curator.framework.imps.CreateBuilderImpl.pathInForeground(CreateBuilderImpl.java:676)
>         at org.apache.curator.framework.imps.CreateBuilderImpl.protectedPathInForeground(CreateBuilderImpl.java:453)
>         at org.apache.curator.framework.imps.CreateBuilderImpl.forPath(CreateBuilderImpl.java:443)
>         at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:251)
>         at org.apache.curator.framework.imps.CreateBuilderImpl$3.forPath(CreateBuilderImpl.java:205)
>         at org.apache.airavata.gfac.core.utils.GFacUtils.createExperimentEntryForPassive(GFacUtils.java:410)
>         at org.apache.airavata.gfac.server.GfacServerHandler$TaskLaunchMessageHandler.onMessage(GfacServerHandler.java:326)
>         at org.apache.airavata.messaging.core.impl.RabbitMQTaskLaunchConsumer$2.handleDelivery(RabbitMQTaskLaunchConsumer.java:195)
>         at com.rabbitmq.client.impl.ConsumerDispatcher$5.run(ConsumerDispatcher.java:144)
>         at com.rabbitmq.client.impl.ConsumerWorkService$WorkPoolRunnable.run(ConsumerWorkService.java:99)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
>         at java.lang.Thread.run(Thread.java:745)
> *deliveryTag:168
>  Message Received with message id 'LAUNCH.TASK-44e7f2fc-d206-4d9d-85cb-c03e7545a479' and with message type 'LAUNCHTASK



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)