You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Benjamin Bannier (JIRA)" <ji...@apache.org> on 2019/01/17 11:16:00 UTC
[jira] [Commented] (MESOS-9130) Test
`StorageLocalResourceProviderTest.ROOT_ContainerTerminationMetric` is
flaky.
[ https://issues.apache.org/jira/browse/MESOS-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16744904#comment-16744904 ]
Benjamin Bannier commented on MESOS-9130:
-----------------------------------------
Reopening as above fix introduced another flakiness.
Review: https://reviews.apache.org/r/69781/
> Test `StorageLocalResourceProviderTest.ROOT_ContainerTerminationMetric` is flaky.
> ---------------------------------------------------------------------------------
>
> Key: MESOS-9130
> URL: https://issues.apache.org/jira/browse/MESOS-9130
> Project: Mesos
> Issue Type: Bug
> Components: resource provider, storage
> Affects Versions: 1.6.0, 1.7.0
> Reporter: Chun-Hung Hsiao
> Assignee: Benjamin Bannier
> Priority: Major
> Labels: mesosphere, storage
> Fix For: 1.8.0
>
> Attachments: test.log
>
>
> This test is flaky and can fail with the following error:
> {noformat}
> ../../src/tests/storage_local_resource_provider_tests.cpp:3167
> Failed to wait 15secs for pluginRestarted{noformat}
> The actual error is the following:
> {noformat}
> E0802 22:13:37.265038 8216 provider.cpp:1496] Failed to reconcile resource provider b9379982-d990-4f63-8a5b-10edd4f5a1bb: Collect failed: OS Error{noformat}
> The root cause is that the SLRP calls {{ListVolumes}} and {{GetCapacity}} when starting up, and if the plugin container is killed when these calls are ongoing, gRPC will return an {{OS Error}} which will lead the SLRP to fail.
> This flakiness will be fixed once we finish https://issues.apache.org/jira/browse/MESOS-8400.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)