You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Benjamin Bannier (JIRA)" <ji...@apache.org> on 2019/01/07 16:18:00 UTC

[jira] [Assigned] (MESOS-9130) Test `StorageLocalResourceProviderTest.ROOT_ContainerTerminationMetric` is flaky.

     [ https://issues.apache.org/jira/browse/MESOS-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Benjamin Bannier reassigned MESOS-9130:
---------------------------------------

    Assignee: Benjamin Bannier

> Test `StorageLocalResourceProviderTest.ROOT_ContainerTerminationMetric` is flaky.
> ---------------------------------------------------------------------------------
>
>                 Key: MESOS-9130
>                 URL: https://issues.apache.org/jira/browse/MESOS-9130
>             Project: Mesos
>          Issue Type: Bug
>          Components: resource provider, storage
>    Affects Versions: 1.6.0, 1.7.0
>            Reporter: Chun-Hung Hsiao
>            Assignee: Benjamin Bannier
>            Priority: Major
>              Labels: mesosphere, storage
>         Attachments: test.log
>
>
> This test is flaky and can fail with the following error:
> {noformat}
> ../../src/tests/storage_local_resource_provider_tests.cpp:3167
> Failed to wait 15secs for pluginRestarted{noformat}
> The actual error is the following:
> {noformat}
> E0802 22:13:37.265038  8216 provider.cpp:1496] Failed to reconcile resource provider b9379982-d990-4f63-8a5b-10edd4f5a1bb: Collect failed: OS Error{noformat}
> The root cause is that the SLRP calls {{ListVolumes}} and {{GetCapacity}} when starting up, and if the plugin container is killed when these calls are ongoing, gRPC will return an {{OS Error}} which will lead the SLRP to fail.
> This flakiness will be fixed once we finish https://issues.apache.org/jira/browse/MESOS-8400.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)