You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@aurora.apache.org by Joe Stein <jo...@stealth.ly> on 2014/08/12 22:00:45 UTC

best practice to use host local drive?

Hi, I wanted to get recommends on what is the best practice within a
launched task by Aurora for accessing some local drive (i.e.
/mnt/data/stuff) within the process running.

Thanks!

/*******************************************
 Joe Stein
 Founder, Principal Consultant
 Big Data Open Source Security LLC
 http://www.stealth.ly
 Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
********************************************/

Re: best practice to use host local drive?

Posted by Chris Lambert <cl...@twitter.com.INVALID>.
On Tue, Aug 12, 2014 at 1:18 PM, Bill Farner <wf...@apache.org> wrote:

> The answer will depend on the use case.  Here's the three most common:
>
> - read/write scratch space (e.g. /tmp)
>
> Prefer using the sandbox instead (avoid the problem).  Otherwise, create a
> directory containing a UUID and try to clean up after yourself.  This is
> ultimately brittle.
>

Just FYI:  there's a Mesos ticket for (optional) per-container tmp dirs,
including an outstanding review:
https://issues.apache.org/jira/browse/MESOS-1586


> - read-only data available on every host
>
> No big deal (aside for races when the data changes), read away.
>
> - read/write, persistent data
>
> There's not a great story here, at least not in the traditional
> mesos/aurora "run anywhere" model.  The best approach we have so far is to
> mount to a directory owned by the user that will access the data, constrain
> tasks to run on that host (dedicated=dba/mysql), and configure the slave
> with the dedicated attribute having the same value.
>
>
>
>
>
> -=Bill
>
>
> On Tue, Aug 12, 2014 at 1:00 PM, Joe Stein <jo...@stealth.ly> wrote:
>
> > Hi, I wanted to get recommends on what is the best practice within a
> > launched task by Aurora for accessing some local drive (i.e.
> > /mnt/data/stuff) within the process running.
> >
> > Thanks!
> >
> > /*******************************************
> >  Joe Stein
> >  Founder, Principal Consultant
> >  Big Data Open Source Security LLC
> >  http://www.stealth.ly
> >  Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
> > ********************************************/
> >
>

Re: best practice to use host local drive?

Posted by Jay Buffington <me...@jaybuff.com>.
On Tue, Aug 12, 2014 at 1:18 PM, Bill Farner <wf...@apache.org> wrote:
> - read/write, persistent data
>
> There's not a great story here, at least not in the traditional
> mesos/aurora "run anywhere" model.  The best approach we have so far is to
> mount to a directory owned by the user that will access the data, constrain
> tasks to run on that host (dedicated=dba/mysql), and configure the slave
> with the dedicated attribute having the same value.

There is a proposal in mesos for the concept of persistent resources.  See
https://issues.apache.org/jira/browse/MESOS-1554

If this was implemented it would give Aurora the ability to launch a task that
used the same storage as a previous task used.

Jay

Re: best practice to use host local drive?

Posted by Bill Farner <wf...@apache.org>.
The answer will depend on the use case.  Here's the three most common:

- read/write scratch space (e.g. /tmp)

Prefer using the sandbox instead (avoid the problem).  Otherwise, create a
directory containing a UUID and try to clean up after yourself.  This is
ultimately brittle.

- read-only data available on every host

No big deal (aside for races when the data changes), read away.

- read/write, persistent data

There's not a great story here, at least not in the traditional
mesos/aurora "run anywhere" model.  The best approach we have so far is to
mount to a directory owned by the user that will access the data, constrain
tasks to run on that host (dedicated=dba/mysql), and configure the slave
with the dedicated attribute having the same value.





-=Bill


On Tue, Aug 12, 2014 at 1:00 PM, Joe Stein <jo...@stealth.ly> wrote:

> Hi, I wanted to get recommends on what is the best practice within a
> launched task by Aurora for accessing some local drive (i.e.
> /mnt/data/stuff) within the process running.
>
> Thanks!
>
> /*******************************************
>  Joe Stein
>  Founder, Principal Consultant
>  Big Data Open Source Security LLC
>  http://www.stealth.ly
>  Twitter: @allthingshadoop <http://www.twitter.com/allthingshadoop>
> ********************************************/
>