You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@aurora.apache.org by Tameem Ahmed <ta...@hotmail.com> on 2015/11/22 17:29:44 UTC

Basic Question : Sandbox disk Space Reclaimed

Hi Team,
           I am putting in baby steps to learn aurora, please excuse me if I am asking very silly/basic questions.
           Please do recommend if there is any other DL/Group who can help us learn little more about the system.
           I have a job for which here is the status :

           INFO] Active Tasks (0)           INFO] Inactive Tasks (4)
           INFO] role: root, env: staging, name: app_1328, shard: 1, status: SANDBOX_DELETED on stage2.qa.com           cpus: 0.01, ram: 500 MB, disk: 1024 MB	   failure count: 1 (max 1)                 2015-11-20 12:10:01 PENDING: None		 2015-11-20 12:10:01 ASSIGNED: None		 2015-11-20 12:10:03 STARTING: Initializing sandbox.		 2015-11-20 12:10:04 RUNNING: None		 2015-11-20 12:13:20 FAILED: Task failed.		 2015-11-21 04:07:41 SANDBOX_DELETED: Sandbox disk space reclaimed.
         What can I do the put the job back in to Active, tried all possibilities.   Made sure the slave machine is running both mesos-slave and observer, have enough disk space.  
         Master is running quite a few jobs, no issues. 
         What can do to make it running again. 
Thanks,Tameem

Re: Basic Question : Sandbox disk Space Reclaimed

Posted by Zameer Manji <zm...@apache.org>.

Tameem,

What version of Aurora are you running? As far as I know SANDBOX_DELETED
was removed in Aurora 0.7.0.

On Mon, Nov 23, 2015 at 4:46 PM, Bill Farner <wf...@apache.org> wrote:

> You'll want to drill into why the task failed.  This usually means the
> process launched for you exited non-zero.  Since the sandbox has been
> deleted (as indicated by the final state transition), you won't be able to
> look at stdout/stderr from the process(es), but that's likely what you'll
> be after.  If you launch the task again, does it fail again?
>
> On Sun, Nov 22, 2015 at 8:29 AM, Tameem Ahmed <ta...@hotmail.com>
> wrote:
>
> > Hi Team,
> >            I am putting in baby steps to learn aurora, please excuse me
> if
> > I am asking very silly/basic questions.
> >            Please do recommend if there is any other DL/Group who can
> help
> > us learn little more about the system.
> >            I have a job for which here is the status :
> >
> >            INFO] Active Tasks (0)           INFO] Inactive Tasks (4)
> >            INFO] role: root, env: staging, name: app_1328, shard: 1,
> > status: SANDBOX_DELETED on stage2.qa.com           cpus: 0.01, ram: 500
> > MB, disk: 1024 MB     failure count: 1 (max 1)                 2015-11-20
> > 12:10:01 PENDING: None            2015-11-20 12:10:01 ASSIGNED: None
> >       2015-11-20 12:10:03 STARTING: Initializing sandbox.
> >  2015-11-20 12:10:04 RUNNING: None               2015-11-20 12:13:20
> > FAILED: Task failed.                2015-11-21 04:07:41 SANDBOX_DELETED:
> > Sandbox disk space reclaimed.
> >          What can I do the put the job back in to Active, tried all
> > possibilities.   Made sure the slave machine is running both mesos-slave
> > and observer, have enough disk space.
> >          Master is running quite a few jobs, no issues.
> >          What can do to make it running again.
> > Thanks,Tameem
> >
> >
>
> --
> Zameer Manji
>
>

Re: Basic Question : Sandbox disk Space Reclaimed

Posted by Bill Farner <wf...@apache.org>.

You'll want to drill into why the task failed.  This usually means the
process launched for you exited non-zero.  Since the sandbox has been
deleted (as indicated by the final state transition), you won't be able to
look at stdout/stderr from the process(es), but that's likely what you'll
be after.  If you launch the task again, does it fail again?

On Sun, Nov 22, 2015 at 8:29 AM, Tameem Ahmed <ta...@hotmail.com>
wrote:

> Hi Team,
>            I am putting in baby steps to learn aurora, please excuse me if
> I am asking very silly/basic questions.
>            Please do recommend if there is any other DL/Group who can help
> us learn little more about the system.
>            I have a job for which here is the status :
>
>            INFO] Active Tasks (0)           INFO] Inactive Tasks (4)
>            INFO] role: root, env: staging, name: app_1328, shard: 1,
> status: SANDBOX_DELETED on stage2.qa.com           cpus: 0.01, ram: 500
> MB, disk: 1024 MB     failure count: 1 (max 1)                 2015-11-20
> 12:10:01 PENDING: None            2015-11-20 12:10:01 ASSIGNED: None
>       2015-11-20 12:10:03 STARTING: Initializing sandbox.
>  2015-11-20 12:10:04 RUNNING: None               2015-11-20 12:13:20
> FAILED: Task failed.                2015-11-21 04:07:41 SANDBOX_DELETED:
> Sandbox disk space reclaimed.
>          What can I do the put the job back in to Active, tried all
> possibilities.   Made sure the slave machine is running both mesos-slave
> and observer, have enough disk space.
>          Master is running quite a few jobs, no issues.
>          What can do to make it running again.
> Thanks,Tameem
>
>