You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@mesos.apache.org by "Cong Wang (JIRA)" <ji...@apache.org> on 2015/09/14 22:12:45 UTC

[jira] [Commented] (MESOS-3423) Perf event isolator stops performing sampling if a single timeout occurs.

    [ https://issues.apache.org/jira/browse/MESOS-3423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744166#comment-14744166 ] 

Cong Wang commented on MESOS-3423:
----------------------------------

Despite restarting sampling, it's interesting to know why it timed out?

> Perf event isolator stops performing sampling if a single timeout occurs.
> -------------------------------------------------------------------------
>
>                 Key: MESOS-3423
>                 URL: https://issues.apache.org/jira/browse/MESOS-3423
>             Project: Mesos
>          Issue Type: Bug
>          Components: isolation, slave
>    Affects Versions: 0.24.0
>            Reporter: Vinod Kone
>            Assignee: Benjamin Mahler
>              Labels: twitter
>
> Currently the perf event isolator times out a sample after a fixed extra time of 2 seconds on top of the sample time elapses:
> {code}
>     Duration timeout = flags.perf_duration + Seconds(2);
> {code}
> This should be based on the reap interval maximum.
> Also, the code stops sampling altogether when a single timeout occurs. We've observed time outs during normal operation, so it would be better for the isolator to continue performing perf sampling in the case of timeouts. It may also make sense to continue sampling in the case of errors, since these may be transient.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)