You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Dmitry Lysnichenko (JIRA)" <ji...@apache.org> on 2014/03/11 21:34:48 UTC

[jira] [Updated] (AMBARI-5036) Secured: Start All Services task got stuck forever

     [ https://issues.apache.org/jira/browse/AMBARI-5036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitry Lysnichenko updated AMBARI-5036:
---------------------------------------

    Description: 
# Deployed 2-node cluster.
# Added 3rd node.
# Enabled security.

After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a very long time.
VMs are alive, ambari-server and all ambari-agents are running.

Finally got a reproduce using 2 commands
{code}
curl 'http://vm-0.vm:8080/api/v1/clusters/cc/services?params/run_smoke_test=false' -X PUT -H 'X-Requested-By: X-Requested-By' -u admin:admin --data '{"RequestInfo": {"context": "Start All Services"}, "Body": {"ServiceInfo": {"state": "STARTED"}}}' ; sleep 3;  curl 'http://vm-0.vm:8080/api/v1/clusters/cc/hosts/vm-0.vm/host_components/APP_TIMELINE_SERVER' -X DELETE -H 'X-Requested-By: X-Requested-By' -u admin:admin
{code}
The way to reproduce is a bit different compared to an original description (I issue a DELETE request in 3 seconds after START_ALL_SERVICES request has been issued), but the symptoms are the same: ServiceComponentHostNotFoundException exception is posted to log and operation is stuck on stage that contains "App Timeline Server Start" command. 


  was:
# Deployed 2-node cluster.
# Added 3rd node.
# Enabled security.

After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a very long time.
VMs are alive, ambari-server and all ambari-agents are running.



> Secured: Start All Services task got stuck forever
> --------------------------------------------------
>
>                 Key: AMBARI-5036
>                 URL: https://issues.apache.org/jira/browse/AMBARI-5036
>             Project: Ambari
>          Issue Type: Bug
>          Components: controller, test
>    Affects Versions: 1.5.0
>            Reporter: Dmitry Lysnichenko
>            Assignee: Dmitry Lysnichenko
>             Fix For: 1.5.1
>
>
> # Deployed 2-node cluster.
> # Added 3rd node.
> # Enabled security.
> After steps above on all 3 hosts tasks jammed and don't want to perform or fail for a very long time.
> VMs are alive, ambari-server and all ambari-agents are running.
> Finally got a reproduce using 2 commands
> {code}
> curl 'http://vm-0.vm:8080/api/v1/clusters/cc/services?params/run_smoke_test=false' -X PUT -H 'X-Requested-By: X-Requested-By' -u admin:admin --data '{"RequestInfo": {"context": "Start All Services"}, "Body": {"ServiceInfo": {"state": "STARTED"}}}' ; sleep 3;  curl 'http://vm-0.vm:8080/api/v1/clusters/cc/hosts/vm-0.vm/host_components/APP_TIMELINE_SERVER' -X DELETE -H 'X-Requested-By: X-Requested-By' -u admin:admin
> {code}
> The way to reproduce is a bit different compared to an original description (I issue a DELETE request in 3 seconds after START_ALL_SERVICES request has been issued), but the symptoms are the same: ServiceComponentHostNotFoundException exception is posted to log and operation is stuck on stage that contains "App Timeline Server Start" command. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)