You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@mesos.apache.org by Dino Lokmic <di...@ngs.ba> on 2017/11/23 09:00:04 UTC

Persistent volumes

I have few machines on Linode and I run Mesos there. Can someone explain to
me, how to set volumes right.

Now I run taks via marathon like this

...

"constraints": [
    [
      "hostname",
      "CLUSTER",
      "HOSTNAME"
    ]
  ],
  "container": {
    "type": "DOCKER",
    "volumes": [
      {
        "containerPath": "/opt/storm/storm-local",
        "hostPath": "/opt/docker_data/storm/storm-local",
        "mode": "RW"
      }
    ],
    "docker": {
      "image": "xxxx",
      "network": "HOST",
      "portMappings": [],
      "privileged": false,
      "parameters": [],
      "forcePullImage": true
    }
  },
...

So if task is restarted I can be sure it has access to previously used data.
You can see I have scaling problem and my task is depending on this node.

I would like for my apps to be node independent and also that they have
redundant data.

What is best practice for this?

I want to scale aplication to 2 instances, I1 and I2

Instance I1 runs on agent A1 and uses volume V1
Instance I2 runs on agent A2 and uses volume V2

If agent A1 stops, I1 is restared to A3 and uses V1
If V1 failes I1 uses copy of data from V3...


Can someone point to article describing this, or at least give me few
"keywords"


Thanks

Re: Persistent volumes

Posted by Dino Lokmic <di...@ngs.ba>.

Thanks, for answers

yes I use Marathon and Mesos.

On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp <he...@gmx.net>
wrote:

> Hi,
>
> I'm using persistent volumes directly on Mesos, without Marathon. For that
> the scheduler (like Marathon) has to first reserve disk space and then
> create a persistent volume with that. The next resource offer message then
> contain the volume in "disk" resource part of the offer. Now you can start
> your task. In the request you would need to include the resources and for
> the "container" part of the request you would have:
>     volumes {
>         container_path: "/mount/point/in/container"
>         host_path: "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>         mode: RW
>     }
>
> The container path is the mount point in your container and the host path
> is the id of your persistent volume.
>
> In case you use marathon the documentation should be this:
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>
> regards,
> Hendrik
>
>
> On 23.11.2017 10:00, Dino Lokmic wrote:
>
>> I have few machines on Linode and I run Mesos there. Can someone explain
>> to me, how to set volumes right.
>>
>> Now I run taks via marathon like this
>>
>> ...
>>
>> "constraints": [
>>     [
>>       "hostname",
>>       "CLUSTER",
>>       "HOSTNAME"
>>     ]
>>   ],
>>   "container": {
>>     "type": "DOCKER",
>>     "volumes": [
>>       {
>>         "containerPath": "/opt/storm/storm-local",
>>         "hostPath": "/opt/docker_data/storm/storm-local",
>>         "mode": "RW"
>>       }
>>     ],
>>     "docker": {
>>       "image": "xxxx",
>>       "network": "HOST",
>>       "portMappings": [],
>>       "privileged": false,
>>       "parameters": [],
>>       "forcePullImage": true
>>     }
>>   },
>> ...
>>
>> So if task is restarted I can be sure it has access to previously used
>> data.
>> You can see I have scaling problem and my task is depending on this node.
>>
>> I would like for my apps to be node independent and also that they have
>> redundant data.
>>
>> What is best practice for this?
>>
>> I want to scale aplication to 2 instances, I1 and I2
>>
>> Instance I1 runs on agent A1 and uses volume V1
>> Instance I2 runs on agent A2 and uses volume V2
>>
>> If agent A1 stops, I1 is restared to A3 and uses V1
>> If V1 failes I1 uses copy of data from V3...
>>
>>
>> Can someone point to article describing this, or at least give me few
>> "keywords"
>>
>>
>> Thanks
>>
>>
>>
>

Re: Persistent volumes

Posted by Benjamin Mahler <bm...@apache.org>.

+jpeach

The polling mechanism is used by the "disk/du" isolator to handle the case
where we don't have filesystem support for enforcing a quota on a
per-directory basis. I believe the "disk/xfs" isolator will stop writes
with EDQUOT without killing the task:

http://mesos.apache.org/documentation/latest/isolators/disk-xfs/

On Tue, Nov 28, 2017 at 1:19 PM, Gabriel Hartmann <ga...@mesosphere.io>
wrote:

> I agree with pretty much everything Hendrik just said with the exception
> of the use of disk quota.  The polling mechanism employed for enforcing
> disk usage implies that any breach of the disk usage limit by a Task also
> implies loss of access to that data forever.  This is true for ROOT volumes
> at least.  MOUNT volumes can be configured to map to "real" devices which
> can provide normal write failures when exceeding disk limits instead of
> essentially revoking all access to data forever.
>
> On Mon, Nov 27, 2017 at 11:34 PM Hendrik Haddorp <he...@gmx.net>
> wrote:
>
>> As said, I only use persistent volumes with my only scheduler straight
>> on Mesos so do not exactly know how this works in Marathon...
>>
>> The persistent volume is created on a Mesos agent and basically ends up
>> being a folder on that hosts disk. So yes, you can not use the volume on
>> a different agent/slave. For marathon you would need to set a hostname
>> constraint that makes sure the same host is used when restarting the
>> task. You won't be able to use fail over to different agents just have
>> Marathon restart your task once it fails. Also only one task at a time
>> can have the volume bound.
>>
>> Yes, you can achieve persistence in pretty much the same way by using a
>> hostpath but then you are using implicit knowledge about your
>> environment, which is not very clean in my opinion, and thus have a
>> tighter coupling. The nice thing about persistent volumes is that they
>> are managed by Mesos. I do not need to tell the Mesos admin that I need
>> space at some location. I do not need to do something special if I have
>> multiple instances running as they get all their own directory. And I
>> can programatically destroy the volume and then the directory on the
>> host gets deleted again (at least since Mesos 1.0). So in my opinion the
>> usage of persistent volumes is much cleaner. But there are certainly use
>> cases that do not really work with them, like being able to fail over to
>> different host. For that you would wither need a shared network mount or
>> storage like HDFS. Btw, the Mesos containerizer should also enforce disk
>> quotas so your task would not be able to fill the filesystem.
>>
>> On 27.11.2017 16:11, Dino Lokmic wrote:
>> > yes I did. So I don't have to prepare it before task? I can't use
>> > volume created on slave A, from slave B
>> >
>> > Once task fails where will it be restarted? Do I have to specify host?
>> >
>> > If I do, it means I can achieve "persistence" same way I deploy now,
>> > by specifying hostpath for volume and hostname
>> >
>> > ....
>> >   "constraints": [
>> >     [
>> >       "hostname",
>> >       "CLUSTER",
>> >       "MYHOSTNAME"
>> >     ]
>> >   ],
>> >   "container": {
>> >     "type": "DOCKER",
>> >     "volumes": [
>> >       {
>> >         "containerPath": "/opt/storm/storm-local",
>> >         "hostPath": "/opt/docker_data/storm/storm-local",
>> >         "mode": "RW"
>> >       },
>> >       {
>> >         "containerPath": "/opt/storm/logs",
>> >         "hostPath": "/opt/docker_logs/storm/logs",
>> >         "mode": "RW"
>> >       },
>> >       {
>> >         "containerPath": "/home/xx/runtime/storm",
>> >         "hostPath": "/home/xx/runtime/storm",
>> >         "mode": "RO"
>> >       }
>> >     ],
>> >     "docker": {
>> >       "image": "xxx/storm-1.1.0",
>> >       "network": "HOST",
>> >       "portMappings": [],
>> >       "privileged": false,
>> >       "parameters": [],
>> >       "forcePullImage": true
>> >     }
>> >   },
>> >
>> > ....
>> >
>> >
>> >
>> > On Mon, Nov 27, 2017 at 3:05 PM, Hendrik Haddorp
>> > <hendrik.haddorp@gmx.net <ma...@gmx.net>> wrote:
>> >
>> >     I have my own scheduler that is performing a create operation. As
>> >     you are using Marathon this call would have to be done by Marathon.
>> >     Did you read
>> >     https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>> >     <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
>> ?
>> >
>> >     On 27.11.2017 14:59, Dino Lokmic wrote:
>> >
>> >         @hendrik
>> >
>> >         How did you create this
>> >         "my-volume-227927c2-3266-412b-8572-92c5c93c051a" volume?
>> >
>> >         On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp
>> >         <hendrik.haddorp@gmx.net <ma...@gmx.net>
>> >         <mailto:hendrik.haddorp@gmx.net
>> >         <ma...@gmx.net>>> wrote:
>> >
>> >             Hi,
>> >
>> >             I'm using persistent volumes directly on Mesos, without
>> >         Marathon.
>> >             For that the scheduler (like Marathon) has to first
>> >         reserve disk
>> >             space and then create a persistent volume with that. The
>> next
>> >             resource offer message then contain the volume in "disk"
>> >         resource
>> >             part of the offer. Now you can start your task. In the
>> >         request you
>> >             would need to include the resources and for the
>> >         "container" part
>> >             of the request you would have:
>> >                 volumes {
>> >                     container_path: "/mount/point/in/container"
>> >                     host_path:
>> >         "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>> >                     mode: RW
>> >                 }
>> >
>> >             The container path is the mount point in your container
>> >         and the
>> >             host path is the id of your persistent volume.
>> >
>> >             In case you use marathon the documentation should be this:
>> >         https://mesosphere.github.io/marathon/docs/persistent-
>> volumes.html
>> >         <https://mesosphere.github.io/marathon/docs/persistent-
>> volumes.html>
>> >
>> >         <https://mesosphere.github.io/marathon/docs/persistent-
>> volumes.html
>> >         <https://mesosphere.github.io/marathon/docs/persistent-
>> volumes.html>>
>> >
>> >             regards,
>> >             Hendrik
>> >
>> >
>> >             On 23.11.2017 10:00, Dino Lokmic wrote:
>> >
>> >                 I have few machines on Linode and I run Mesos there. Can
>> >                 someone explain to me, how to set volumes right.
>> >
>> >                 Now I run taks via marathon like this
>> >
>> >                 ...
>> >
>> >                 "constraints": [
>> >                     [
>> >                       "hostname",
>> >                       "CLUSTER",
>> >                       "HOSTNAME"
>> >                     ]
>> >                   ],
>> >                   "container": {
>> >                     "type": "DOCKER",
>> >                     "volumes": [
>> >                       {
>> >                         "containerPath": "/opt/storm/storm-local",
>> >                         "hostPath": "/opt/docker_data/storm/storm-
>> local",
>> >                         "mode": "RW"
>> >                       }
>> >                     ],
>> >                     "docker": {
>> >                       "image": "xxxx",
>> >                       "network": "HOST",
>> >                       "portMappings": [],
>> >                       "privileged": false,
>> >                       "parameters": [],
>> >                       "forcePullImage": true
>> >                     }
>> >                   },
>> >                 ...
>> >
>> >                 So if task is restarted I can be sure it has access to
>> >                 previously used data.
>> >                 You can see I have scaling problem and my task is
>> >         depending on
>> >                 this node.
>> >
>> >                 I would like for my apps to be node independent and
>> >         also that
>> >                 they have redundant data.
>> >
>> >                 What is best practice for this?
>> >
>> >                 I want to scale aplication to 2 instances, I1 and I2
>> >
>> >                 Instance I1 runs on agent A1 and uses volume V1
>> >                 Instance I2 runs on agent A2 and uses volume V2
>> >
>> >                 If agent A1 stops, I1 is restared to A3 and uses V1
>> >                 If V1 failes I1 uses copy of data from V3...
>> >
>> >
>> >                 Can someone point to article describing this, or at
>> >         least give
>> >                 me few "keywords"
>> >
>> >
>> >                 Thanks
>> >
>> >
>> >
>> >
>> >
>> >
>>
>>

Re: Persistent volumes

Posted by Gabriel Hartmann <ga...@mesosphere.io>.

I agree with pretty much everything Hendrik just said with the exception of
the use of disk quota.  The polling mechanism employed for enforcing disk
usage implies that any breach of the disk usage limit by a Task also
implies loss of access to that data forever.  This is true for ROOT volumes
at least.  MOUNT volumes can be configured to map to "real" devices which
can provide normal write failures when exceeding disk limits instead of
essentially revoking all access to data forever.

On Mon, Nov 27, 2017 at 11:34 PM Hendrik Haddorp <he...@gmx.net>
wrote:

> As said, I only use persistent volumes with my only scheduler straight
> on Mesos so do not exactly know how this works in Marathon...
>
> The persistent volume is created on a Mesos agent and basically ends up
> being a folder on that hosts disk. So yes, you can not use the volume on
> a different agent/slave. For marathon you would need to set a hostname
> constraint that makes sure the same host is used when restarting the
> task. You won't be able to use fail over to different agents just have
> Marathon restart your task once it fails. Also only one task at a time
> can have the volume bound.
>
> Yes, you can achieve persistence in pretty much the same way by using a
> hostpath but then you are using implicit knowledge about your
> environment, which is not very clean in my opinion, and thus have a
> tighter coupling. The nice thing about persistent volumes is that they
> are managed by Mesos. I do not need to tell the Mesos admin that I need
> space at some location. I do not need to do something special if I have
> multiple instances running as they get all their own directory. And I
> can programatically destroy the volume and then the directory on the
> host gets deleted again (at least since Mesos 1.0). So in my opinion the
> usage of persistent volumes is much cleaner. But there are certainly use
> cases that do not really work with them, like being able to fail over to
> different host. For that you would wither need a shared network mount or
> storage like HDFS. Btw, the Mesos containerizer should also enforce disk
> quotas so your task would not be able to fill the filesystem.
>
> On 27.11.2017 16:11, Dino Lokmic wrote:
> > yes I did. So I don't have to prepare it before task? I can't use
> > volume created on slave A, from slave B
> >
> > Once task fails where will it be restarted? Do I have to specify host?
> >
> > If I do, it means I can achieve "persistence" same way I deploy now,
> > by specifying hostpath for volume and hostname
> >
> > ....
> >   "constraints": [
> >     [
> >       "hostname",
> >       "CLUSTER",
> >       "MYHOSTNAME"
> >     ]
> >   ],
> >   "container": {
> >     "type": "DOCKER",
> >     "volumes": [
> >       {
> >         "containerPath": "/opt/storm/storm-local",
> >         "hostPath": "/opt/docker_data/storm/storm-local",
> >         "mode": "RW"
> >       },
> >       {
> >         "containerPath": "/opt/storm/logs",
> >         "hostPath": "/opt/docker_logs/storm/logs",
> >         "mode": "RW"
> >       },
> >       {
> >         "containerPath": "/home/xx/runtime/storm",
> >         "hostPath": "/home/xx/runtime/storm",
> >         "mode": "RO"
> >       }
> >     ],
> >     "docker": {
> >       "image": "xxx/storm-1.1.0",
> >       "network": "HOST",
> >       "portMappings": [],
> >       "privileged": false,
> >       "parameters": [],
> >       "forcePullImage": true
> >     }
> >   },
> >
> > ....
> >
> >
> >
> > On Mon, Nov 27, 2017 at 3:05 PM, Hendrik Haddorp
> > <hendrik.haddorp@gmx.net <ma...@gmx.net>> wrote:
> >
> >     I have my own scheduler that is performing a create operation. As
> >     you are using Marathon this call would have to be done by Marathon.
> >     Did you read
> >     https://mesosphere.github.io/marathon/docs/persistent-volumes.html
> >     <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
> ?
> >
> >     On 27.11.2017 14:59, Dino Lokmic wrote:
> >
> >         @hendrik
> >
> >         How did you create this
> >         "my-volume-227927c2-3266-412b-8572-92c5c93c051a" volume?
> >
> >         On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp
> >         <hendrik.haddorp@gmx.net <ma...@gmx.net>
> >         <mailto:hendrik.haddorp@gmx.net
> >         <ma...@gmx.net>>> wrote:
> >
> >             Hi,
> >
> >             I'm using persistent volumes directly on Mesos, without
> >         Marathon.
> >             For that the scheduler (like Marathon) has to first
> >         reserve disk
> >             space and then create a persistent volume with that. The next
> >             resource offer message then contain the volume in "disk"
> >         resource
> >             part of the offer. Now you can start your task. In the
> >         request you
> >             would need to include the resources and for the
> >         "container" part
> >             of the request you would have:
> >                 volumes {
> >                     container_path: "/mount/point/in/container"
> >                     host_path:
> >         "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
> >                     mode: RW
> >                 }
> >
> >             The container path is the mount point in your container
> >         and the
> >             host path is the id of your persistent volume.
> >
> >             In case you use marathon the documentation should be this:
> >
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html
> >         <
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
> >
> >         <
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html
> >         <
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html>>
> >
> >             regards,
> >             Hendrik
> >
> >
> >             On 23.11.2017 10:00, Dino Lokmic wrote:
> >
> >                 I have few machines on Linode and I run Mesos there. Can
> >                 someone explain to me, how to set volumes right.
> >
> >                 Now I run taks via marathon like this
> >
> >                 ...
> >
> >                 "constraints": [
> >                     [
> >                       "hostname",
> >                       "CLUSTER",
> >                       "HOSTNAME"
> >                     ]
> >                   ],
> >                   "container": {
> >                     "type": "DOCKER",
> >                     "volumes": [
> >                       {
> >                         "containerPath": "/opt/storm/storm-local",
> >                         "hostPath": "/opt/docker_data/storm/storm-local",
> >                         "mode": "RW"
> >                       }
> >                     ],
> >                     "docker": {
> >                       "image": "xxxx",
> >                       "network": "HOST",
> >                       "portMappings": [],
> >                       "privileged": false,
> >                       "parameters": [],
> >                       "forcePullImage": true
> >                     }
> >                   },
> >                 ...
> >
> >                 So if task is restarted I can be sure it has access to
> >                 previously used data.
> >                 You can see I have scaling problem and my task is
> >         depending on
> >                 this node.
> >
> >                 I would like for my apps to be node independent and
> >         also that
> >                 they have redundant data.
> >
> >                 What is best practice for this?
> >
> >                 I want to scale aplication to 2 instances, I1 and I2
> >
> >                 Instance I1 runs on agent A1 and uses volume V1
> >                 Instance I2 runs on agent A2 and uses volume V2
> >
> >                 If agent A1 stops, I1 is restared to A3 and uses V1
> >                 If V1 failes I1 uses copy of data from V3...
> >
> >
> >                 Can someone point to article describing this, or at
> >         least give
> >                 me few "keywords"
> >
> >
> >                 Thanks
> >
> >
> >
> >
> >
> >
>
>

Re: Persistent volumes

Posted by Hendrik Haddorp <he...@gmx.net>.

As said, I only use persistent volumes with my only scheduler straight 
on Mesos so do not exactly know how this works in Marathon...

The persistent volume is created on a Mesos agent and basically ends up 
being a folder on that hosts disk. So yes, you can not use the volume on 
a different agent/slave. For marathon you would need to set a hostname 
constraint that makes sure the same host is used when restarting the 
task. You won't be able to use fail over to different agents just have 
Marathon restart your task once it fails. Also only one task at a time 
can have the volume bound.

Yes, you can achieve persistence in pretty much the same way by using a 
hostpath but then you are using implicit knowledge about your 
environment, which is not very clean in my opinion, and thus have a 
tighter coupling. The nice thing about persistent volumes is that they 
are managed by Mesos. I do not need to tell the Mesos admin that I need 
space at some location. I do not need to do something special if I have 
multiple instances running as they get all their own directory. And I 
can programatically destroy the volume and then the directory on the 
host gets deleted again (at least since Mesos 1.0). So in my opinion the 
usage of persistent volumes is much cleaner. But there are certainly use 
cases that do not really work with them, like being able to fail over to 
different host. For that you would wither need a shared network mount or 
storage like HDFS. Btw, the Mesos containerizer should also enforce disk 
quotas so your task would not be able to fill the filesystem.

On 27.11.2017 16:11, Dino Lokmic wrote:
> yes I did. So I don't have to prepare it before task? I can't use 
> volume created on slave A, from slave B
>
> Once task fails where will it be restarted? Do I have to specify host?
>
> If I do, it means I can achieve "persistence" same way I deploy now, 
> by specifying hostpath for volume and hostname
>
> ....
>   "constraints": [
>     [
>       "hostname",
>       "CLUSTER",
>       "MYHOSTNAME"
>     ]
>   ],
>   "container": {
>     "type": "DOCKER",
>     "volumes": [
>       {
>         "containerPath": "/opt/storm/storm-local",
>         "hostPath": "/opt/docker_data/storm/storm-local",
>         "mode": "RW"
>       },
>       {
>         "containerPath": "/opt/storm/logs",
>         "hostPath": "/opt/docker_logs/storm/logs",
>         "mode": "RW"
>       },
>       {
>         "containerPath": "/home/xx/runtime/storm",
>         "hostPath": "/home/xx/runtime/storm",
>         "mode": "RO"
>       }
>     ],
>     "docker": {
>       "image": "xxx/storm-1.1.0",
>       "network": "HOST",
>       "portMappings": [],
>       "privileged": false,
>       "parameters": [],
>       "forcePullImage": true
>     }
>   },
>
> ....
>
>
>
> On Mon, Nov 27, 2017 at 3:05 PM, Hendrik Haddorp 
> <hendrik.haddorp@gmx.net <ma...@gmx.net>> wrote:
>
>     I have my own scheduler that is performing a create operation. As
>     you are using Marathon this call would have to be done by Marathon.
>     Did you read
>     https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>     <https://mesosphere.github.io/marathon/docs/persistent-volumes.html> ?
>
>     On 27.11.2017 14:59, Dino Lokmic wrote:
>
>         @hendrik
>
>         How did you create this
>         "my-volume-227927c2-3266-412b-8572-92c5c93c051a" volume?
>
>         On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp
>         <hendrik.haddorp@gmx.net <ma...@gmx.net>
>         <mailto:hendrik.haddorp@gmx.net
>         <ma...@gmx.net>>> wrote:
>
>             Hi,
>
>             I'm using persistent volumes directly on Mesos, without
>         Marathon.
>             For that the scheduler (like Marathon) has to first
>         reserve disk
>             space and then create a persistent volume with that. The next
>             resource offer message then contain the volume in "disk"
>         resource
>             part of the offer. Now you can start your task. In the
>         request you
>             would need to include the resources and for the
>         "container" part
>             of the request you would have:
>                 volumes {
>                     container_path: "/mount/point/in/container"
>                     host_path:
>         "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>                     mode: RW
>                 }
>
>             The container path is the mount point in your container
>         and the
>             host path is the id of your persistent volume.
>
>             In case you use marathon the documentation should be this:
>         https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>         <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
>            
>         <https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>         <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>>
>
>             regards,
>             Hendrik
>
>
>             On 23.11.2017 10:00, Dino Lokmic wrote:
>
>                 I have few machines on Linode and I run Mesos there. Can
>                 someone explain to me, how to set volumes right.
>
>                 Now I run taks via marathon like this
>
>                 ...
>
>                 "constraints": [
>                     [
>                       "hostname",
>                       "CLUSTER",
>                       "HOSTNAME"
>                     ]
>                   ],
>                   "container": {
>                     "type": "DOCKER",
>                     "volumes": [
>                       {
>                         "containerPath": "/opt/storm/storm-local",
>                         "hostPath": "/opt/docker_data/storm/storm-local",
>                         "mode": "RW"
>                       }
>                     ],
>                     "docker": {
>                       "image": "xxxx",
>                       "network": "HOST",
>                       "portMappings": [],
>                       "privileged": false,
>                       "parameters": [],
>                       "forcePullImage": true
>                     }
>                   },
>                 ...
>
>                 So if task is restarted I can be sure it has access to
>                 previously used data.
>                 You can see I have scaling problem and my task is
>         depending on
>                 this node.
>
>                 I would like for my apps to be node independent and
>         also that
>                 they have redundant data.
>
>                 What is best practice for this?
>
>                 I want to scale aplication to 2 instances, I1 and I2
>
>                 Instance I1 runs on agent A1 and uses volume V1
>                 Instance I2 runs on agent A2 and uses volume V2
>
>                 If agent A1 stops, I1 is restared to A3 and uses V1
>                 If V1 failes I1 uses copy of data from V3...
>
>
>                 Can someone point to article describing this, or at
>         least give
>                 me few "keywords"
>
>
>                 Thanks
>
>
>
>
>
>

Re: Persistent volumes

Posted by Dino Lokmic <di...@ngs.ba>.

yes I did. So I don't have to prepare it before task? I can't use volume
created on slave A, from slave B

Once task fails where will it be restarted? Do I have to specify host?

If I do, it means I can achieve "persistence" same way I deploy now, by
specifying hostpath for volume and hostname

....
  "constraints": [
    [
      "hostname",
      "CLUSTER",
      "MYHOSTNAME"
    ]
  ],
  "container": {
    "type": "DOCKER",
    "volumes": [
      {
        "containerPath": "/opt/storm/storm-local",
        "hostPath": "/opt/docker_data/storm/storm-local",
        "mode": "RW"
      },
      {
        "containerPath": "/opt/storm/logs",
        "hostPath": "/opt/docker_logs/storm/logs",
        "mode": "RW"
      },
      {
        "containerPath": "/home/xx/runtime/storm",
        "hostPath": "/home/xx/runtime/storm",
        "mode": "RO"
      }
    ],
    "docker": {
      "image": "xxx/storm-1.1.0",
      "network": "HOST",
      "portMappings": [],
      "privileged": false,
      "parameters": [],
      "forcePullImage": true
    }
  },

....




On Mon, Nov 27, 2017 at 3:05 PM, Hendrik Haddorp <he...@gmx.net>
wrote:

> I have my own scheduler that is performing a create operation. As you are
> using Marathon this call would have to be done by Marathon.
> Did you read https://mesosphere.github.io/marathon/docs/persistent-volume
> s.html ?
>
> On 27.11.2017 14:59, Dino Lokmic wrote:
>
>> @hendrik
>>
>> How did you create this "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>> volume?
>>
>> On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp <hendrik.haddorp@gmx.net
>> <ma...@gmx.net>> wrote:
>>
>>     Hi,
>>
>>     I'm using persistent volumes directly on Mesos, without Marathon.
>>     For that the scheduler (like Marathon) has to first reserve disk
>>     space and then create a persistent volume with that. The next
>>     resource offer message then contain the volume in "disk" resource
>>     part of the offer. Now you can start your task. In the request you
>>     would need to include the resources and for the "container" part
>>     of the request you would have:
>>         volumes {
>>             container_path: "/mount/point/in/container"
>>             host_path: "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>>             mode: RW
>>         }
>>
>>     The container path is the mount point in your container and the
>>     host path is the id of your persistent volume.
>>
>>     In case you use marathon the documentation should be this:
>>     https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>>     <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
>>
>>     regards,
>>     Hendrik
>>
>>
>>     On 23.11.2017 10:00, Dino Lokmic wrote:
>>
>>         I have few machines on Linode and I run Mesos there. Can
>>         someone explain to me, how to set volumes right.
>>
>>         Now I run taks via marathon like this
>>
>>         ...
>>
>>         "constraints": [
>>             [
>>               "hostname",
>>               "CLUSTER",
>>               "HOSTNAME"
>>             ]
>>           ],
>>           "container": {
>>             "type": "DOCKER",
>>             "volumes": [
>>               {
>>                 "containerPath": "/opt/storm/storm-local",
>>                 "hostPath": "/opt/docker_data/storm/storm-local",
>>                 "mode": "RW"
>>               }
>>             ],
>>             "docker": {
>>               "image": "xxxx",
>>               "network": "HOST",
>>               "portMappings": [],
>>               "privileged": false,
>>               "parameters": [],
>>               "forcePullImage": true
>>             }
>>           },
>>         ...
>>
>>         So if task is restarted I can be sure it has access to
>>         previously used data.
>>         You can see I have scaling problem and my task is depending on
>>         this node.
>>
>>         I would like for my apps to be node independent and also that
>>         they have redundant data.
>>
>>         What is best practice for this?
>>
>>         I want to scale aplication to 2 instances, I1 and I2
>>
>>         Instance I1 runs on agent A1 and uses volume V1
>>         Instance I2 runs on agent A2 and uses volume V2
>>
>>         If agent A1 stops, I1 is restared to A3 and uses V1
>>         If V1 failes I1 uses copy of data from V3...
>>
>>
>>         Can someone point to article describing this, or at least give
>>         me few "keywords"
>>
>>
>>         Thanks
>>
>>
>>
>>
>>
>

Re: Persistent volumes

Posted by Hendrik Haddorp <he...@gmx.net>.

I have my own scheduler that is performing a create operation. As you 
are using Marathon this call would have to be done by Marathon.
Did you read 
https://mesosphere.github.io/marathon/docs/persistent-volumes.html ?

On 27.11.2017 14:59, Dino Lokmic wrote:
> @hendrik
>
> How did you create this 
> "my-volume-227927c2-3266-412b-8572-92c5c93c051a" volume?
>
> On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp 
> <hendrik.haddorp@gmx.net <ma...@gmx.net>> wrote:
>
>     Hi,
>
>     I'm using persistent volumes directly on Mesos, without Marathon.
>     For that the scheduler (like Marathon) has to first reserve disk
>     space and then create a persistent volume with that. The next
>     resource offer message then contain the volume in "disk" resource
>     part of the offer. Now you can start your task. In the request you
>     would need to include the resources and for the "container" part
>     of the request you would have:
>         volumes {
>             container_path: "/mount/point/in/container"
>             host_path: "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>             mode: RW
>         }
>
>     The container path is the mount point in your container and the
>     host path is the id of your persistent volume.
>
>     In case you use marathon the documentation should be this:
>     https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>     <https://mesosphere.github.io/marathon/docs/persistent-volumes.html>
>
>     regards,
>     Hendrik
>
>
>     On 23.11.2017 10:00, Dino Lokmic wrote:
>
>         I have few machines on Linode and I run Mesos there. Can
>         someone explain to me, how to set volumes right.
>
>         Now I run taks via marathon like this
>
>         ...
>
>         "constraints": [
>             [
>               "hostname",
>               "CLUSTER",
>               "HOSTNAME"
>             ]
>           ],
>           "container": {
>             "type": "DOCKER",
>             "volumes": [
>               {
>                 "containerPath": "/opt/storm/storm-local",
>                 "hostPath": "/opt/docker_data/storm/storm-local",
>                 "mode": "RW"
>               }
>             ],
>             "docker": {
>               "image": "xxxx",
>               "network": "HOST",
>               "portMappings": [],
>               "privileged": false,
>               "parameters": [],
>               "forcePullImage": true
>             }
>           },
>         ...
>
>         So if task is restarted I can be sure it has access to
>         previously used data.
>         You can see I have scaling problem and my task is depending on
>         this node.
>
>         I would like for my apps to be node independent and also that
>         they have redundant data.
>
>         What is best practice for this?
>
>         I want to scale aplication to 2 instances, I1 and I2
>
>         Instance I1 runs on agent A1 and uses volume V1
>         Instance I2 runs on agent A2 and uses volume V2
>
>         If agent A1 stops, I1 is restared to A3 and uses V1
>         If V1 failes I1 uses copy of data from V3...
>
>
>         Can someone point to article describing this, or at least give
>         me few "keywords"
>
>
>         Thanks
>
>
>
>

Re: Persistent volumes

Posted by Dino Lokmic <di...@ngs.ba>.

@hendrik

How did you create this "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
volume?

On Mon, Nov 27, 2017 at 7:59 AM, Hendrik Haddorp <he...@gmx.net>
wrote:

> Hi,
>
> I'm using persistent volumes directly on Mesos, without Marathon. For that
> the scheduler (like Marathon) has to first reserve disk space and then
> create a persistent volume with that. The next resource offer message then
> contain the volume in "disk" resource part of the offer. Now you can start
> your task. In the request you would need to include the resources and for
> the "container" part of the request you would have:
>     volumes {
>         container_path: "/mount/point/in/container"
>         host_path: "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
>         mode: RW
>     }
>
> The container path is the mount point in your container and the host path
> is the id of your persistent volume.
>
> In case you use marathon the documentation should be this:
> https://mesosphere.github.io/marathon/docs/persistent-volumes.html
>
> regards,
> Hendrik
>
>
> On 23.11.2017 10:00, Dino Lokmic wrote:
>
>> I have few machines on Linode and I run Mesos there. Can someone explain
>> to me, how to set volumes right.
>>
>> Now I run taks via marathon like this
>>
>> ...
>>
>> "constraints": [
>>     [
>>       "hostname",
>>       "CLUSTER",
>>       "HOSTNAME"
>>     ]
>>   ],
>>   "container": {
>>     "type": "DOCKER",
>>     "volumes": [
>>       {
>>         "containerPath": "/opt/storm/storm-local",
>>         "hostPath": "/opt/docker_data/storm/storm-local",
>>         "mode": "RW"
>>       }
>>     ],
>>     "docker": {
>>       "image": "xxxx",
>>       "network": "HOST",
>>       "portMappings": [],
>>       "privileged": false,
>>       "parameters": [],
>>       "forcePullImage": true
>>     }
>>   },
>> ...
>>
>> So if task is restarted I can be sure it has access to previously used
>> data.
>> You can see I have scaling problem and my task is depending on this node.
>>
>> I would like for my apps to be node independent and also that they have
>> redundant data.
>>
>> What is best practice for this?
>>
>> I want to scale aplication to 2 instances, I1 and I2
>>
>> Instance I1 runs on agent A1 and uses volume V1
>> Instance I2 runs on agent A2 and uses volume V2
>>
>> If agent A1 stops, I1 is restared to A3 and uses V1
>> If V1 failes I1 uses copy of data from V3...
>>
>>
>> Can someone point to article describing this, or at least give me few
>> "keywords"
>>
>>
>> Thanks
>>
>>
>>
>

Re: Persistent volumes

Posted by Hendrik Haddorp <he...@gmx.net>.

Hi,

I'm using persistent volumes directly on Mesos, without Marathon. For 
that the scheduler (like Marathon) has to first reserve disk space and 
then create a persistent volume with that. The next resource offer 
message then contain the volume in "disk" resource part of the offer. 
Now you can start your task. In the request you would need to include 
the resources and for the "container" part of the request you would have:
     volumes {
         container_path: "/mount/point/in/container"
         host_path: "my-volume-227927c2-3266-412b-8572-92c5c93c051a"
         mode: RW
     }

The container path is the mount point in your container and the host 
path is the id of your persistent volume.

In case you use marathon the documentation should be this: 
https://mesosphere.github.io/marathon/docs/persistent-volumes.html

regards,
Hendrik

On 23.11.2017 10:00, Dino Lokmic wrote:
> I have few machines on Linode and I run Mesos there. Can someone 
> explain to me, how to set volumes right.
>
> Now I run taks via marathon like this
>
> ...
>
> "constraints": [
>     [
>       "hostname",
>       "CLUSTER",
>       "HOSTNAME"
>     ]
>   ],
>   "container": {
>     "type": "DOCKER",
>     "volumes": [
>       {
>         "containerPath": "/opt/storm/storm-local",
>         "hostPath": "/opt/docker_data/storm/storm-local",
>         "mode": "RW"
>       }
>     ],
>     "docker": {
>       "image": "xxxx",
>       "network": "HOST",
>       "portMappings": [],
>       "privileged": false,
>       "parameters": [],
>       "forcePullImage": true
>     }
>   },
> ...
>
> So if task is restarted I can be sure it has access to previously used 
> data.
> You can see I have scaling problem and my task is depending on this node.
>
> I would like for my apps to be node independent and also that they 
> have redundant data.
>
> What is best practice for this?
>
> I want to scale aplication to 2 instances, I1 and I2
>
> Instance I1 runs on agent A1 and uses volume V1
> Instance I2 runs on agent A2 and uses volume V2
>
> If agent A1 stops, I1 is restared to A3 and uses V1
> If V1 failes I1 uses copy of data from V3...
>
>
> Can someone point to article describing this, or at least give me few 
> "keywords"
>
>
> Thanks
>
>

Re: Persistent volumes

Posted by Judith Malnick <jm...@mesosphere.io>.

Are you using DC/OS or "vanilla" Mesos and Marathon, without DC/OS? If you
are using Mesos, you might get a better answer on the Mesos mailing list
<http://mesos.apache.org/community/#mailing-lists>, and you can also check
out these docs on persistent volumes
<http://mesos.apache.org/documentation/latest/persistent-volume/>.

Hope this helps!
Judith

On Thu, Nov 23, 2017 at 1:00 AM, Dino Lokmic <di...@ngs.ba> wrote:

> I have few machines on Linode and I run Mesos there. Can someone explain
> to me, how to set volumes right.
>
> Now I run taks via marathon like this
>
> ...
>
> "constraints": [
>     [
>       "hostname",
>       "CLUSTER",
>       "HOSTNAME"
>     ]
>   ],
>   "container": {
>     "type": "DOCKER",
>     "volumes": [
>       {
>         "containerPath": "/opt/storm/storm-local",
>         "hostPath": "/opt/docker_data/storm/storm-local",
>         "mode": "RW"
>       }
>     ],
>     "docker": {
>       "image": "xxxx",
>       "network": "HOST",
>       "portMappings": [],
>       "privileged": false,
>       "parameters": [],
>       "forcePullImage": true
>     }
>   },
> ...
>
> So if task is restarted I can be sure it has access to previously used
> data.
> You can see I have scaling problem and my task is depending on this node.
>
> I would like for my apps to be node independent and also that they have
> redundant data.
>
> What is best practice for this?
>
> I want to scale aplication to 2 instances, I1 and I2
>
> Instance I1 runs on agent A1 and uses volume V1
> Instance I2 runs on agent A2 and uses volume V2
>
> If agent A1 stops, I1 is restared to A3 and uses V1
> If V1 failes I1 uses copy of data from V3...
>
>
> Can someone point to article describing this, or at least give me few
> "keywords"
>
>
> Thanks
>
>
>


-- 
Judith Malnick
Community Manager
310-709-1517