You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@airavata.apache.org by "Shameera Rathnayaka (JIRA)" <ji...@apache.org> on 2015/03/30 23:36:53 UTC

[jira] [Updated] (AIRAVATA-1354) Job monitor for Stampede unknow status

     [ https://issues.apache.org/jira/browse/AIRAVATA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Shameera Rathnayaka updated AIRAVATA-1354:
------------------------------------------
    Labels: Monitoring  (was: )

> Job monitor for Stampede unknow status
> --------------------------------------
>
>                 Key: AIRAVATA-1354
>                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1354
>             Project: Airavata
>          Issue Type: Improvement
>          Components: GFac
>            Reporter: Raminderjeet Singh
>              Labels: Monitoring
>
> We should using experiment id to name the jobs for unique identifier and then use that job name to identify if the job get to unknown status. If the job still is in unknown state we should check in working directory for stdout/err and make corrective action to correct the UNKNOWN statues. Same logic will be useful for job recovery if Airavata server restart.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)