You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@reef.apache.org by "Mariia Mykhailova (JIRA)" <ji...@apache.org> on 2016/11/21 20:27:59 UTC

[jira] [Updated] (REEF-1244) Group Communication does not close down properly at the end if reej job

     [ https://issues.apache.org/jira/browse/REEF-1244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mariia Mykhailova updated REEF-1244:
------------------------------------
    Fix Version/s:     (was: 0.13)

> Group Communication does not close down properly at the end if reej job
> -----------------------------------------------------------------------
>
>                 Key: REEF-1244
>                 URL: https://issues.apache.org/jira/browse/REEF-1244
>             Project: REEF
>          Issue Type: Bug
>          Components: GroupCommunications
>    Affects Versions: 0.13
>         Environment: C#
>            Reporter: Dhruv Mahajan
>            Assignee: Dhruv Mahajan
>
> Currently, when we want to shut down evaluator, the dispose function of group communications will be called. However, there is a race condition that occurs. For example, suppose evaluator e1 calls dispose and closes the stream with evaluator e2. Then if e2 is in ReadAsync() function of the stream, we will get a failure since Dispose() function in e2 is still not called. Moreover, the Dispose() function in e2 will try to close the already closed stream again. 
> Some of these scenarios are handled by catching Exceptions and ignoring them but some are not captured and throw errors which leads to driver and reef job failing.
> The aim of this JIRA is to identify all these closing scenarios and handle them appropriately.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)