You are viewing a plain text version of this content. The canonical link for it is here.

Posted to users@cloudstack.apache.org by li jerry <di...@hotmail.com> on 2019/05/28 04:16:25 UTC

RBD primary storage VM encounters Exclusive Lock after triggering HA

Hello guys

we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
We found some issues with the HA solution, and we are here to ask for you suggestions.

We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.

The following information is prompted after VM starts:

Generating "/run/initramfs/rdsosreport.txt"

Entering emergency mode. Exit the shell to continue.
Type "journalctl" to view system logs.
You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
after mounting them and attach it to a bug report

:/#

We found this is caused by the lock on the image:
[root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
There is 1 exclusive lock on this image.
Locker ID Address
client.1164351 auto 94464726847232 10.226.16.128:0/3002249644

If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.

We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?

Thanks.

Re: Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Never mentioned anything on stability :) and as usual, if user wants to
upgrade, in general, this is OK, but you risk the consequences of
compatibility with external software (ACS) as Li does.
LTS comment was due to a couple of slides from latest Cephalocon, I didn't
actually follow the release notes.

That being said, Nautilus is waiting for somebody to test in production
(but it ain't gonna be me :) )

Cheers
Andrija

On Tue, 28 May 2019 at 11:58, Haijiao <18...@163.com> wrote:

> Hi, Andrija
>
>
> I think Ceph has a versioning convention that x.2.z means stable release
> for production. http://docs.ceph.com/docs/master/releases/schedule/
>
>
> x.0.z - development releases (for early testers and the brave at heart)
> x.1.z - release candidates (for test clusters, brave users)
> x.2.z - stable/bugfix releases (for users)
>
>
> And if we look into the release note of Minic, it states something like
> 'This is the fifth bugfix release of the Mimic v13.2.x long term stable
> release series. We recommend all Mimic users upgrade.'
> http://docs.ceph.com/docs/master/releases/mimic/#v13-2-4-mimic
>
>
> :-)
>
>
> At 2019-05-28 14:40:45, "Andrija Panic" <an...@gmail.com> wrote:
> >Hi Li,
> >
> >You would like to take a look at the next PR from Wido -
> >https://github.com/apache/cloudstack/pull/2985 - this is 4.12 only.
> >
> >In other words, you are using Mimic, non-LTS release of Ceph - and I have
> a
> >hard time believing that anyone is using this in production with
> CloudStack
> >(since it's decently recent Ceph release).
> >
> >Test a ACS 4.12 and see if your problem goes away.
> >
> >@Wido den Hollander <wi...@42on.com> , any thought?
> >
> >Regards,
> >Andrija
> >
> >On Tue, 28 May 2019 at 06:24, li jerry <di...@hotmail.com> wrote:
> >
> >> Hello guys
> >>
> >> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6),
> >> and Ceph 13.2.5 is deployed as the primary storage.
> >> We found some issues with the HA solution, and we are here to ask for
> you
> >> suggestions.
> >>
> >> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> >> compute offering is tagged as ha.
> >> When we try to perform a power failure test (unplug 1 node of 4), the
> >> running VMs on the removed node is automatically rescheduled to the
> other
> >> living nodes after 5 minutes, but all of them can not boot into the OS.
> We
> >> found the booting procedure is stuck by the IO read/write failure.
> >>
> >>
> >>
> >> The following information is prompted after VM starts:
> >>
> >> Generating "/run/initramfs/rdsosreport.txt"
> >>
> >> Entering emergency mode. Exit the shell to continue.
> >> Type "journalctl" to view system logs.
> >> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick
> or
> >> /boot
> >> after mounting them and attach it to a bug report
> >>
> >> :/#
> >>
> >>
> >>
> >> We found this is caused by the lock on the image:
> >> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> >> There is 1 exclusive lock on this image.
> >> Locker         ID                  Address
> >> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> >>
> >> If we remove the lock from the image, and restart the VM under
> CloudStack,
> >> this VM will boot successfully.
> >>
> >> We know that if we disable the Exclusive Lock feature (by setting
> >> rbd_default_features = 3) for Ceph would solve this problem. But we
> don’t
> >> think it’s the best solution for the HA, so could you please give us
> some
> >> ideas about how you are doing and what is the best practice for this
> >> feature?
> >>
> >> Thanks.
> >>
> >>
> >
> >--
> >
> >Andrija Panić
>


-- 

Andrija Panić

Re: Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Never mentioned anything on stability :) and as usual, if user wants to
upgrade, in general, this is OK, but you risk the consequences of
compatibility with external software (ACS) as Li does.
LTS comment was due to a couple of slides from latest Cephalocon, I didn't
actually follow the release notes.

That being said, Nautilus is waiting for somebody to test in production
(but it ain't gonna be me :) )

Cheers
Andrija

On Tue, 28 May 2019 at 11:58, Haijiao <18...@163.com> wrote:

> Hi, Andrija
>
>
> I think Ceph has a versioning convention that x.2.z means stable release
> for production. http://docs.ceph.com/docs/master/releases/schedule/
>
>
> x.0.z - development releases (for early testers and the brave at heart)
> x.1.z - release candidates (for test clusters, brave users)
> x.2.z - stable/bugfix releases (for users)
>
>
> And if we look into the release note of Minic, it states something like
> 'This is the fifth bugfix release of the Mimic v13.2.x long term stable
> release series. We recommend all Mimic users upgrade.'
> http://docs.ceph.com/docs/master/releases/mimic/#v13-2-4-mimic
>
>
> :-)
>
>
> At 2019-05-28 14:40:45, "Andrija Panic" <an...@gmail.com> wrote:
> >Hi Li,
> >
> >You would like to take a look at the next PR from Wido -
> >https://github.com/apache/cloudstack/pull/2985 - this is 4.12 only.
> >
> >In other words, you are using Mimic, non-LTS release of Ceph - and I have
> a
> >hard time believing that anyone is using this in production with
> CloudStack
> >(since it's decently recent Ceph release).
> >
> >Test a ACS 4.12 and see if your problem goes away.
> >
> >@Wido den Hollander <wi...@42on.com> , any thought?
> >
> >Regards,
> >Andrija
> >
> >On Tue, 28 May 2019 at 06:24, li jerry <di...@hotmail.com> wrote:
> >
> >> Hello guys
> >>
> >> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6),
> >> and Ceph 13.2.5 is deployed as the primary storage.
> >> We found some issues with the HA solution, and we are here to ask for
> you
> >> suggestions.
> >>
> >> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> >> compute offering is tagged as ha.
> >> When we try to perform a power failure test (unplug 1 node of 4), the
> >> running VMs on the removed node is automatically rescheduled to the
> other
> >> living nodes after 5 minutes, but all of them can not boot into the OS.
> We
> >> found the booting procedure is stuck by the IO read/write failure.
> >>
> >>
> >>
> >> The following information is prompted after VM starts:
> >>
> >> Generating "/run/initramfs/rdsosreport.txt"
> >>
> >> Entering emergency mode. Exit the shell to continue.
> >> Type "journalctl" to view system logs.
> >> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick
> or
> >> /boot
> >> after mounting them and attach it to a bug report
> >>
> >> :/#
> >>
> >>
> >>
> >> We found this is caused by the lock on the image:
> >> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> >> There is 1 exclusive lock on this image.
> >> Locker         ID                  Address
> >> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> >>
> >> If we remove the lock from the image, and restart the VM under
> CloudStack,
> >> this VM will boot successfully.
> >>
> >> We know that if we disable the Exclusive Lock feature (by setting
> >> rbd_default_features = 3) for Ceph would solve this problem. But we
> don’t
> >> think it’s the best solution for the HA, so could you please give us
> some
> >> ideas about how you are doing and what is the best practice for this
> >> feature?
> >>
> >> Thanks.
> >>
> >>
> >
> >--
> >
> >Andrija Panić
>


-- 

Andrija Panić

Re:Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Haijiao <18...@163.com>.

Hi, Andrija


I think Ceph has a versioning convention that x.2.z means stable release for production. http://docs.ceph.com/docs/master/releases/schedule/


x.0.z - development releases (for early testers and the brave at heart)
x.1.z - release candidates (for test clusters, brave users)
x.2.z - stable/bugfix releases (for users)


And if we look into the release note of Minic, it states something like  'This is the fifth bugfix release of the Mimic v13.2.x long term stable release series. We recommend all Mimic users upgrade.'  
http://docs.ceph.com/docs/master/releases/mimic/#v13-2-4-mimic


:-)


At 2019-05-28 14:40:45, "Andrija Panic" <an...@gmail.com> wrote:
>Hi Li,
>
>You would like to take a look at the next PR from Wido -
>https://github.com/apache/cloudstack/pull/2985 - this is 4.12 only.
>
>In other words, you are using Mimic, non-LTS release of Ceph - and I have a
>hard time believing that anyone is using this in production with CloudStack
>(since it's decently recent Ceph release).
>
>Test a ACS 4.12 and see if your problem goes away.
>
>@Wido den Hollander <wi...@42on.com> , any thought?
>
>Regards,
>Andrija
>
>On Tue, 28 May 2019 at 06:24, li jerry <di...@hotmail.com> wrote:
>
>> Hello guys
>>
>> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6),
>> and Ceph 13.2.5 is deployed as the primary storage.
>> We found some issues with the HA solution, and we are here to ask for you
>> suggestions.
>>
>> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
>> compute offering is tagged as ha.
>> When we try to perform a power failure test (unplug 1 node of 4), the
>> running VMs on the removed node is automatically rescheduled to the other
>> living nodes after 5 minutes, but all of them can not boot into the OS. We
>> found the booting procedure is stuck by the IO read/write failure.
>>
>>
>>
>> The following information is prompted after VM starts:
>>
>> Generating "/run/initramfs/rdsosreport.txt"
>>
>> Entering emergency mode. Exit the shell to continue.
>> Type "journalctl" to view system logs.
>> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or
>> /boot
>> after mounting them and attach it to a bug report
>>
>> :/#
>>
>>
>>
>> We found this is caused by the lock on the image:
>> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
>> There is 1 exclusive lock on this image.
>> Locker         ID                  Address
>> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>>
>> If we remove the lock from the image, and restart the VM under CloudStack,
>> this VM will boot successfully.
>>
>> We know that if we disable the Exclusive Lock feature (by setting
>> rbd_default_features = 3) for Ceph would solve this problem. But we don’t
>> think it’s the best solution for the HA, so could you please give us some
>> ideas about how you are doing and what is the best practice for this
>> feature?
>>
>> Thanks.
>>
>>
>
>-- 
>
>Andrija Panić

Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Hi Li,

You would like to take a look at the next PR from Wido -
https://github.com/apache/cloudstack/pull/2985 - this is 4.12 only.

In other words, you are using Mimic, non-LTS release of Ceph - and I have a
hard time believing that anyone is using this in production with CloudStack
(since it's decently recent Ceph release).

Test a ACS 4.12 and see if your problem goes away.

@Wido den Hollander <wi...@42on.com> , any thought?

Regards,
Andrija

On Tue, 28 May 2019 at 06:24, li jerry <di...@hotmail.com> wrote:

> Hello guys
>
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6),
> and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you
> suggestions.
>
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the
> running VMs on the removed node is automatically rescheduled to the other
> living nodes after 5 minutes, but all of them can not boot into the OS. We
> found the booting procedure is stuck by the IO read/write failure.
>
>
>
> The following information is prompted after VM starts:
>
> Generating "/run/initramfs/rdsosreport.txt"
>
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or
> /boot
> after mounting them and attach it to a bug report
>
> :/#
>
>
>
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>
> If we remove the lock from the image, and restart the VM under CloudStack,
> this VM will boot successfully.
>
> We know that if we disable the Exclusive Lock feature (by setting
> rbd_default_features = 3) for Ceph would solve this problem. But we don’t
> think it’s the best solution for the HA, so could you please give us some
> ideas about how you are doing and what is the best practice for this
> feature?
>
> Thanks.
>
>

-- 

Andrija Panić

Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Hi Li,

You would like to take a look at the next PR from Wido -
https://github.com/apache/cloudstack/pull/2985 - this is 4.12 only.

In other words, you are using Mimic, non-LTS release of Ceph - and I have a
hard time believing that anyone is using this in production with CloudStack
(since it's decently recent Ceph release).

Test a ACS 4.12 and see if your problem goes away.

@Wido den Hollander <wi...@42on.com> , any thought?

Regards,
Andrija

On Tue, 28 May 2019 at 06:24, li jerry <di...@hotmail.com> wrote:

> Hello guys
>
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6),
> and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you
> suggestions.
>
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the
> running VMs on the removed node is automatically rescheduled to the other
> living nodes after 5 minutes, but all of them can not boot into the OS. We
> found the booting procedure is stuck by the IO read/write failure.
>
>
>
> The following information is prompted after VM starts:
>
> Generating "/run/initramfs/rdsosreport.txt"
>
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or
> /boot
> after mounting them and attach it to a bug report
>
> :/#
>
>
>
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>
> If we remove the lock from the image, and restart the VM under CloudStack,
> this VM will boot successfully.
>
> We know that if we disable the Exclusive Lock feature (by setting
> rbd_default_features = 3) for Ceph would solve this problem. But we don’t
> think it’s the best solution for the HA, so could you please give us some
> ideas about how you are doing and what is the best practice for this
> feature?
>
> Thanks.
>
>

-- 

Andrija Panić

Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Thx Wido!

On Tue, 28 May 2019 at 13:51, Wido den Hollander <wi...@widodh.nl> wrote:

>
>
> On 5/28/19 1:48 PM, li jerry wrote:
> > Hi Wido
> >
> >
> >
> > I filled in the CLOUDSTACK is the following KEY
> >
> >
> >
> > [root@cn01-nodeb ~]# ceph auth get client.cloudstack
> >
> > exported keyring for client.cloudstack
> >
> > [client.cloudstack]
> >
> >       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
> >
> >       caps mon = "allow r"
> >
> >       caps osd = "allow rwx pool=rbd"
> >
> >
>
> That's the problem :-) Your user needs to be updated.
>
> The caps should be:
>
> [client.cloudstack]
>      key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
>      caps mon = "profile rbd"
>      caps osd = "profile rbd pool=rbd"
>
> See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/
>
> This will allow the client to blacklist the other and take over the
> exclusive-lock.
>
> Wido
>
> >
> > *发件人: *Wido den Hollander <ma...@widodh.nl>
> > *发送时间: *2019年5月28日19:42
> > *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> > li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> > <ma...@cloudstack.apache.org>
> > *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> > triggering HA
> >
> >
> >
> >
> >
> > On 5/28/19 6:16 AM, li jerry wrote:
> >> Hello guys
> >>
> >> we’ve deployed an environment with CloudStack 4.11.2 and
> KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> >> We found some issues with the HA solution, and we are here to ask for
> you suggestions.
> >>
> >> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> compute offering is tagged as ha.
> >> When we try to perform a power failure test (unplug 1 node of 4), the
> running VMs on the removed node is automatically rescheduled to the other
> living nodes after 5 minutes, but all of them can not boot into the OS. We
> found the booting procedure is stuck by the IO read/write failure.
> >>
> >>
> >>
> >> The following information is prompted after VM starts:
> >>
> >> Generating "/run/initramfs/rdsosreport.txt"
> >>
> >> Entering emergency mode. Exit the shell to continue.
> >> Type "journalctl" to view system logs.
> >> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick
> or /boot
> >> after mounting them and attach it to a bug report
> >>
> >> :/#
> >>
> >>
> >>
> >> We found this is caused by the lock on the image:
> >> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> >> There is 1 exclusive lock on this image.
> >> Locker         ID                  Address
> >> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> >>
> >> If we remove the lock from the image, and restart the VM under
> CloudStack, this VM will boot successfully.
> >>
> >> We know that if we disable the Exclusive Lock feature (by setting
> rbd_default_features = 3) for Ceph would solve this problem. But we don’t
> think it’s the best solution for the HA, so could you please give us some
> ideas about how you are doing and what is the best practice for this
> feature?
> >>
> >
> > exclusive-lock is something to prevent a split-brain and having two
> > clients write to it at the same time.
> >
> > The lock should be released to the other client if this is requested,
> > but I have the feeling that you might have a cephx problem there.
> >
> > Can you post the output of:
> >
> > $ ceph auth get client.X
> >
> > Where you replace X by the user you are using for CloudStack? Also
> > remove they 'key', I don't need that.
> >
> > I want to look at the caps of the user.
> >
> > Wido
> >
> >> Thanks.
> >>
> >>
> >
> >
> >
>


-- 

Andrija Panić

答复: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by li jerry <di...@hotmail.com>.

Thx Wido！



after the ceph admin node executed the following command, my problem was solved.



[root@cn01-nodea ~]#  ceph auth caps client.cloudstack mon 'allow profile rbd' osd 'allow profile rbd pool=rbd'





________________________________
发件人: Wido den Hollander <wi...@widodh.nl>
发送时间: Tuesday, May 28, 2019 7:50:43 PM
收件人: li jerry; dev@cloudstack.apache.org; users@cloudstack.apache.org
主题: Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA



On 5/28/19 1:48 PM, li jerry wrote:
> Hi Wido
>
>
>
> I filled in the CLOUDSTACK is the following KEY
>
>
>
> [root@cn01-nodeb ~]# ceph auth get client.cloudstack
>
> exported keyring for client.cloudstack
>
> [client.cloudstack]
>
>       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
>
>       caps mon = "allow r"
>
>       caps osd = "allow rwx pool=rbd"
>
>

That's the problem :-) Your user needs to be updated.

The caps should be:

[client.cloudstack]
     key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
     caps mon = "profile rbd"
     caps osd = "profile rbd pool=rbd"

See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/

This will allow the client to blacklist the other and take over the
exclusive-lock.

Wido

>
> *发件人: *Wido den Hollander <ma...@widodh.nl>
> *发送时间: *2019年5月28日19:42
> *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> <ma...@cloudstack.apache.org>
> *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> triggering HA
>
>
>
>
>
> On 5/28/19 6:16 AM, li jerry wrote:
>> Hello guys
>>
>> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
>> We found some issues with the HA solution, and we are here to ask for you suggestions.
>>
>> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
>> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>>
>>
>>
>> The following information is prompted after VM starts:
>>
>> Generating "/run/initramfs/rdsosreport.txt"
>>
>> Entering emergency mode. Exit the shell to continue.
>> Type "journalctl" to view system logs.
>> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
>> after mounting them and attach it to a bug report
>>
>> :/#
>>
>>
>>
>> We found this is caused by the lock on the image:
>> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
>> There is 1 exclusive lock on this image.
>> Locker         ID                  Address
>> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>>
>> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>>
>> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>>
>
> exclusive-lock is something to prevent a split-brain and having two
> clients write to it at the same time.
>
> The lock should be released to the other client if this is requested,
> but I have the feeling that you might have a cephx problem there.
>
> Can you post the output of:
>
> $ ceph auth get client.X
>
> Where you replace X by the user you are using for CloudStack? Also
> remove they 'key', I don't need that.
>
> I want to look at the caps of the user.
>
> Wido
>
>> Thanks.
>>
>>
>
>
>

Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Andrija Panic <an...@gmail.com>.

Thx Wido!

On Tue, 28 May 2019 at 13:51, Wido den Hollander <wi...@widodh.nl> wrote:

>
>
> On 5/28/19 1:48 PM, li jerry wrote:
> > Hi Wido
> >
> >
> >
> > I filled in the CLOUDSTACK is the following KEY
> >
> >
> >
> > [root@cn01-nodeb ~]# ceph auth get client.cloudstack
> >
> > exported keyring for client.cloudstack
> >
> > [client.cloudstack]
> >
> >       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
> >
> >       caps mon = "allow r"
> >
> >       caps osd = "allow rwx pool=rbd"
> >
> >
>
> That's the problem :-) Your user needs to be updated.
>
> The caps should be:
>
> [client.cloudstack]
>      key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
>      caps mon = "profile rbd"
>      caps osd = "profile rbd pool=rbd"
>
> See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/
>
> This will allow the client to blacklist the other and take over the
> exclusive-lock.
>
> Wido
>
> >
> > *发件人: *Wido den Hollander <ma...@widodh.nl>
> > *发送时间: *2019年5月28日19:42
> > *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> > li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> > <ma...@cloudstack.apache.org>
> > *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> > triggering HA
> >
> >
> >
> >
> >
> > On 5/28/19 6:16 AM, li jerry wrote:
> >> Hello guys
> >>
> >> we’ve deployed an environment with CloudStack 4.11.2 and
> KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> >> We found some issues with the HA solution, and we are here to ask for
> you suggestions.
> >>
> >> We’ve both enabled VM HA and Host HA feature in CloudStack, and the
> compute offering is tagged as ha.
> >> When we try to perform a power failure test (unplug 1 node of 4), the
> running VMs on the removed node is automatically rescheduled to the other
> living nodes after 5 minutes, but all of them can not boot into the OS. We
> found the booting procedure is stuck by the IO read/write failure.
> >>
> >>
> >>
> >> The following information is prompted after VM starts:
> >>
> >> Generating "/run/initramfs/rdsosreport.txt"
> >>
> >> Entering emergency mode. Exit the shell to continue.
> >> Type "journalctl" to view system logs.
> >> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick
> or /boot
> >> after mounting them and attach it to a bug report
> >>
> >> :/#
> >>
> >>
> >>
> >> We found this is caused by the lock on the image:
> >> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> >> There is 1 exclusive lock on this image.
> >> Locker         ID                  Address
> >> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> >>
> >> If we remove the lock from the image, and restart the VM under
> CloudStack, this VM will boot successfully.
> >>
> >> We know that if we disable the Exclusive Lock feature (by setting
> rbd_default_features = 3) for Ceph would solve this problem. But we don’t
> think it’s the best solution for the HA, so could you please give us some
> ideas about how you are doing and what is the best practice for this
> feature?
> >>
> >
> > exclusive-lock is something to prevent a split-brain and having two
> > clients write to it at the same time.
> >
> > The lock should be released to the other client if this is requested,
> > but I have the feeling that you might have a cephx problem there.
> >
> > Can you post the output of:
> >
> > $ ceph auth get client.X
> >
> > Where you replace X by the user you are using for CloudStack? Also
> > remove they 'key', I don't need that.
> >
> > I want to look at the caps of the user.
> >
> > Wido
> >
> >> Thanks.
> >>
> >>
> >
> >
> >
>


-- 

Andrija Panić

答复: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by li jerry <di...@hotmail.com>.

Thx Wido！



after the ceph admin node executed the following command, my problem was solved.



[root@cn01-nodea ~]#  ceph auth caps client.cloudstack mon 'allow profile rbd' osd 'allow profile rbd pool=rbd'





________________________________
发件人: Wido den Hollander <wi...@widodh.nl>
发送时间: Tuesday, May 28, 2019 7:50:43 PM
收件人: li jerry; dev@cloudstack.apache.org; users@cloudstack.apache.org
主题: Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA



On 5/28/19 1:48 PM, li jerry wrote:
> Hi Wido
>
>
>
> I filled in the CLOUDSTACK is the following KEY
>
>
>
> [root@cn01-nodeb ~]# ceph auth get client.cloudstack
>
> exported keyring for client.cloudstack
>
> [client.cloudstack]
>
>       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
>
>       caps mon = "allow r"
>
>       caps osd = "allow rwx pool=rbd"
>
>

That's the problem :-) Your user needs to be updated.

The caps should be:

[client.cloudstack]
     key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
     caps mon = "profile rbd"
     caps osd = "profile rbd pool=rbd"

See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/

This will allow the client to blacklist the other and take over the
exclusive-lock.

Wido

>
> *发件人: *Wido den Hollander <ma...@widodh.nl>
> *发送时间: *2019年5月28日19:42
> *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> <ma...@cloudstack.apache.org>
> *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> triggering HA
>
>
>
>
>
> On 5/28/19 6:16 AM, li jerry wrote:
>> Hello guys
>>
>> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
>> We found some issues with the HA solution, and we are here to ask for you suggestions.
>>
>> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
>> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>>
>>
>>
>> The following information is prompted after VM starts:
>>
>> Generating "/run/initramfs/rdsosreport.txt"
>>
>> Entering emergency mode. Exit the shell to continue.
>> Type "journalctl" to view system logs.
>> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
>> after mounting them and attach it to a bug report
>>
>> :/#
>>
>>
>>
>> We found this is caused by the lock on the image:
>> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
>> There is 1 exclusive lock on this image.
>> Locker         ID                  Address
>> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>>
>> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>>
>> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>>
>
> exclusive-lock is something to prevent a split-brain and having two
> clients write to it at the same time.
>
> The lock should be released to the other client if this is requested,
> but I have the feeling that you might have a cephx problem there.
>
> Can you post the output of:
>
> $ ceph auth get client.X
>
> Where you replace X by the user you are using for CloudStack? Also
> remove they 'key', I don't need that.
>
> I want to look at the caps of the user.
>
> Wido
>
>> Thanks.
>>
>>
>
>
>

Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Wido den Hollander <wi...@widodh.nl>.


On 5/28/19 1:48 PM, li jerry wrote:
> Hi Wido
> 
>  
> 
> I filled in the CLOUDSTACK is the following KEY
> 
>  
> 
> [root@cn01-nodeb ~]# ceph auth get client.cloudstack
> 
> exported keyring for client.cloudstack
> 
> [client.cloudstack]
> 
>       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
> 
>       caps mon = "allow r"
> 
>       caps osd = "allow rwx pool=rbd"
> 
>  

That's the problem :-) Your user needs to be updated.

The caps should be:

[client.cloudstack]
     key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
     caps mon = "profile rbd"
     caps osd = "profile rbd pool=rbd"

See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/

This will allow the client to blacklist the other and take over the
exclusive-lock.

Wido

> 
> *发件人: *Wido den Hollander <ma...@widodh.nl>
> *发送时间: *2019年5月28日19:42
> *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> <ma...@cloudstack.apache.org>
> *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> triggering HA
> 
>  
> 
> 
> 
> On 5/28/19 6:16 AM, li jerry wrote:
>> Hello guys
>> 
>> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
>> We found some issues with the HA solution, and we are here to ask for you suggestions.
>> 
>> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
>> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>> 
>> 
>> 
>> The following information is prompted after VM starts:
>> 
>> Generating "/run/initramfs/rdsosreport.txt"
>> 
>> Entering emergency mode. Exit the shell to continue.
>> Type "journalctl" to view system logs.
>> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
>> after mounting them and attach it to a bug report
>> 
>> :/#
>> 
>> 
>> 
>> We found this is caused by the lock on the image:
>> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
>> There is 1 exclusive lock on this image.
>> Locker         ID                  Address
>> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>> 
>> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>> 
>> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>> 
> 
> exclusive-lock is something to prevent a split-brain and having two
> clients write to it at the same time.
> 
> The lock should be released to the other client if this is requested,
> but I have the feeling that you might have a cephx problem there.
> 
> Can you post the output of:
> 
> $ ceph auth get client.X
> 
> Where you replace X by the user you are using for CloudStack? Also
> remove they 'key', I don't need that.
> 
> I want to look at the caps of the user.
> 
> Wido
> 
>> Thanks.
>> 
>> 
> 
>  
>

Re: 答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Wido den Hollander <wi...@widodh.nl>.


On 5/28/19 1:48 PM, li jerry wrote:
> Hi Wido
> 
>  
> 
> I filled in the CLOUDSTACK is the following KEY
> 
>  
> 
> [root@cn01-nodeb ~]# ceph auth get client.cloudstack
> 
> exported keyring for client.cloudstack
> 
> [client.cloudstack]
> 
>       key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
> 
>       caps mon = "allow r"
> 
>       caps osd = "allow rwx pool=rbd"
> 
>  

That's the problem :-) Your user needs to be updated.

The caps should be:

[client.cloudstack]
     key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
     caps mon = "profile rbd"
     caps osd = "profile rbd pool=rbd"

See: http://docs.ceph.com/docs/master/rbd/rbd-cloudstack/

This will allow the client to blacklist the other and take over the
exclusive-lock.

Wido

> 
> *发件人: *Wido den Hollander <ma...@widodh.nl>
> *发送时间: *2019年5月28日19:42
> *收件人: *dev@cloudstack.apache.org <ma...@cloudstack.apache.org>;
> li jerry <ma...@hotmail.com>; users@cloudstack.apache.org
> <ma...@cloudstack.apache.org>
> *主题: *Re: RBD primary storage VM encounters Exclusive Lock after
> triggering HA
> 
>  
> 
> 
> 
> On 5/28/19 6:16 AM, li jerry wrote:
>> Hello guys
>> 
>> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
>> We found some issues with the HA solution, and we are here to ask for you suggestions.
>> 
>> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
>> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>> 
>> 
>> 
>> The following information is prompted after VM starts:
>> 
>> Generating "/run/initramfs/rdsosreport.txt"
>> 
>> Entering emergency mode. Exit the shell to continue.
>> Type "journalctl" to view system logs.
>> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
>> after mounting them and attach it to a bug report
>> 
>> :/#
>> 
>> 
>> 
>> We found this is caused by the lock on the image:
>> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
>> There is 1 exclusive lock on this image.
>> Locker         ID                  Address
>> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>> 
>> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>> 
>> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>> 
> 
> exclusive-lock is something to prevent a split-brain and having two
> clients write to it at the same time.
> 
> The lock should be released to the other client if this is requested,
> but I have the feeling that you might have a cephx problem there.
> 
> Can you post the output of:
> 
> $ ceph auth get client.X
> 
> Where you replace X by the user you are using for CloudStack? Also
> remove they 'key', I don't need that.
> 
> I want to look at the caps of the user.
> 
> Wido
> 
>> Thanks.
>> 
>> 
> 
>  
>

答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by li jerry <di...@hotmail.com>.

Hi Wido

I filled in the CLOUDSTACK is the following KEY

[root@cn01-nodeb ~]# ceph auth get client.cloudstack
exported keyring for client.cloudstack
[client.cloudstack]
      key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
      caps mon = "allow r"
      caps osd = "allow rwx pool=rbd"

发件人: Wido den Hollander<ma...@widodh.nl>
发送时间: 2019年5月28日 19:42
收件人: dev@cloudstack.apache.org<ma...@cloudstack.apache.org>; li jerry<ma...@hotmail.com>; users@cloudstack.apache.org<ma...@cloudstack.apache.org>
主题: Re: RBD primary storage VM encounters Exclusive Lock after triggering HA



On 5/28/19 6:16 AM, li jerry wrote:
> Hello guys
>
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you suggestions.
>
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>
>
>
> The following information is prompted after VM starts:
>
> Generating "/run/initramfs/rdsosreport.txt"
>
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
> after mounting them and attach it to a bug report
>
> :/#
>
>
>
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>
> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>
> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>

exclusive-lock is something to prevent a split-brain and having two
clients write to it at the same time.

The lock should be released to the other client if this is requested,
but I have the feeling that you might have a cephx problem there.

Can you post the output of:

$ ceph auth get client.X

Where you replace X by the user you are using for CloudStack? Also
remove they 'key', I don't need that.

I want to look at the caps of the user.

Wido

> Thanks.
>
>

答复: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by li jerry <di...@hotmail.com>.

Hi Wido

I filled in the CLOUDSTACK is the following KEY

[root@cn01-nodeb ~]# ceph auth get client.cloudstack
exported keyring for client.cloudstack
[client.cloudstack]
      key = AQDTh7pcIJjNIhAAwk8jtxilJWXQR7osJRFMLw==
      caps mon = "allow r"
      caps osd = "allow rwx pool=rbd"

发件人: Wido den Hollander<ma...@widodh.nl>
发送时间: 2019年5月28日 19:42
收件人: dev@cloudstack.apache.org<ma...@cloudstack.apache.org>; li jerry<ma...@hotmail.com>; users@cloudstack.apache.org<ma...@cloudstack.apache.org>
主题: Re: RBD primary storage VM encounters Exclusive Lock after triggering HA



On 5/28/19 6:16 AM, li jerry wrote:
> Hello guys
>
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you suggestions.
>
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
>
>
>
> The following information is prompted after VM starts:
>
> Generating "/run/initramfs/rdsosreport.txt"
>
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
> after mounting them and attach it to a bug report
>
> :/#
>
>
>
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
>
> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
>
> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
>

exclusive-lock is something to prevent a split-brain and having two
clients write to it at the same time.

The lock should be released to the other client if this is requested,
but I have the feeling that you might have a cephx problem there.

Can you post the output of:

$ ceph auth get client.X

Where you replace X by the user you are using for CloudStack? Also
remove they 'key', I don't need that.

I want to look at the caps of the user.

Wido

> Thanks.
>
>

Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Wido den Hollander <wi...@widodh.nl>.


On 5/28/19 6:16 AM, li jerry wrote:
> Hello guys
> 
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you suggestions.
> 
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
> 
> 
> 
> The following information is prompted after VM starts:
> 
> Generating "/run/initramfs/rdsosreport.txt"
> 
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
> after mounting them and attach it to a bug report
> 
> :/#
> 
> 
> 
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> 
> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
> 
> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
> 

exclusive-lock is something to prevent a split-brain and having two
clients write to it at the same time.

The lock should be released to the other client if this is requested,
but I have the feeling that you might have a cephx problem there.

Can you post the output of:

$ ceph auth get client.X

Where you replace X by the user you are using for CloudStack? Also
remove they 'key', I don't need that.

I want to look at the caps of the user.

Wido

> Thanks.
> 
>

Re: RBD primary storage VM encounters Exclusive Lock after triggering HA

Posted by Wido den Hollander <wi...@widodh.nl>.


On 5/28/19 6:16 AM, li jerry wrote:
> Hello guys
> 
> we’ve deployed an environment with CloudStack 4.11.2 and KVM(CentOS7.6), and Ceph 13.2.5 is deployed as the primary storage.
> We found some issues with the HA solution, and we are here to ask for you suggestions.
> 
> We’ve both enabled VM HA and Host HA feature in CloudStack, and the compute offering is tagged as ha.
> When we try to perform a power failure test (unplug 1 node of 4), the running VMs on the removed node is automatically rescheduled to the other living nodes after 5 minutes, but all of them can not boot into the OS. We found the booting procedure is stuck by the IO read/write failure.
> 
> 
> 
> The following information is prompted after VM starts:
> 
> Generating "/run/initramfs/rdsosreport.txt"
> 
> Entering emergency mode. Exit the shell to continue.
> Type "journalctl" to view system logs.
> You might want to save "/run/initramfs/rdsosreport.txt" to a USB stick or /boot
> after mounting them and attach it to a bug report
> 
> :/#
> 
> 
> 
> We found this is caused by the lock on the image:
> [root@cn01-nodea ~]# rbd lock list a93010b0-2be2-49bd-b25e-ec89b3a98b4b
> There is 1 exclusive lock on this image.
> Locker         ID                  Address
> client.1164351 auto 94464726847232 10.226.16.128:0/3002249644
> 
> If we remove the lock from the image, and restart the VM under CloudStack, this VM will boot successfully.
> 
> We know that if we disable the Exclusive Lock feature (by setting rbd_default_features = 3) for Ceph would solve this problem. But we don’t think it’s the best solution for the HA, so could you please give us some ideas about how you are doing and what is the best practice for this feature?
> 

exclusive-lock is something to prevent a split-brain and having two
clients write to it at the same time.

The lock should be released to the other client if this is requested,
but I have the feeling that you might have a cephx problem there.

Can you post the output of:

$ ceph auth get client.X

Where you replace X by the user you are using for CloudStack? Also
remove they 'key', I don't need that.

I want to look at the caps of the user.

Wido

> Thanks.
> 
>