You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@reef.apache.org by "Shravan Matthur Narayanamurthy (JIRA)" <ji...@apache.org> on 2016/12/01 00:55:58 UTC

[jira] [Commented] (REEF-1674) Random Failures in Broadcast and Reduce Fault Tolerance tests

    [ https://issues.apache.org/jira/browse/REEF-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15710396#comment-15710396 ] 

Shravan Matthur Narayanamurthy commented on REEF-1674:
------------------------------------------------------

Mariia, you were right. I was not sure if ```PoisonedEventHandler``` causes failed evaluator or not. But after looking at it, it seems to as it raises exception on the clock. So removed the exit handler and used it instead. Thanks!

> Random Failures in Broadcast and Reduce Fault Tolerance tests
> -------------------------------------------------------------
>
>                 Key: REEF-1674
>                 URL: https://issues.apache.org/jira/browse/REEF-1674
>             Project: REEF
>          Issue Type: Improvement
>          Components: REEF.NET IO
>    Affects Versions: 0.16
>            Reporter: Shravan Matthur Narayanamurthy
>            Assignee: Shravan Matthur Narayanamurthy
>            Priority: Minor
>             Fix For: 0.16
>
>
> The current fault tolerance tests inject simulated failure in a controlled manner and hence are not the right failure model to test our fault tolerance work. It would be good to have failures injected randomly than only at specific points as is done in the current code.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)