You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Vinod Kone (JIRA)" <ji...@apache.org> on 2018/11/26 20:59:00 UTC

[jira] [Commented] (MESOS-8930) THREADSAFE_SnapshotTimeout is flaky.

    [ https://issues.apache.org/jira/browse/MESOS-8930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16699577#comment-16699577 ] 

Vinod Kone commented on MESOS-8930:
-----------------------------------

Still seeing this in CI.

 

[~bmahler] Do we have any abstractions/techniques in place that allows us to ensure the http request is enqueued in a more robust matter? Sounds like the 10ms is sometimes not enough in ASF CI.

 

Kinda unrelated bug here is that the code does a "response->body" on a (possibly pending) future causing it to hang forever. This will block the whole test suite!

{code}

  AWAIT_EXPECT_RESPONSE_STATUS_EQ(OK().status, response);

 

  // Parse the response.

  Try<JSON::Object> responseJSON = JSON::parse<JSON::Object>(response->body);

  ASSERT_SOME(responseJSON);

{code}

 

I think we should atleast change the `AWAIT_EXPECT_*` above to `AWAIT_ASSERT` so that the rest of the test code is skipped. cc [~greggomann] [~bmahler]

 

> THREADSAFE_SnapshotTimeout is flaky.
> ------------------------------------
>
>                 Key: MESOS-8930
>                 URL: https://issues.apache.org/jira/browse/MESOS-8930
>             Project: Mesos
>          Issue Type: Bug
>          Components: test
>         Environment: Ubuntu 16.04
>            Reporter: Alexander Rukletsov
>            Assignee: Benjamin Mahler
>            Priority: Major
>              Labels: flaky-test, mesosphere
>
> Observed on ASF CI, might be related to a recent test change https://reviews.apache.org/r/66831/
> {noformat}
> 18:23:31 2: [ RUN      ] MetricsTest.THREADSAFE_SnapshotTimeout
> 18:23:31 2: I0516 18:23:31.747611 16246 process.cpp:3583] Handling HTTP event for process 'metrics' with path: '/metrics/snapshot'
> 18:23:31 2: I0516 18:23:31.796871 16251 process.cpp:3583] Handling HTTP event for process 'metrics' with path: '/metrics/snapshot'
> 18:23:46 2: /tmp/SRC/3rdparty/libprocess/src/tests/metrics_tests.cpp:425: Failure
> 18:23:46 2: Failed to wait 15secs for response
> 22:57:13 Build timed out (after 300 minutes). Marking the build as failed.
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)