You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@samza.apache.org by "Navina Ramesh (JIRA)" <ji...@apache.org> on 2017/05/05 08:31:04 UTC

[jira] [Commented] (SAMZA-1150) Handling Error propagation between ZkJobCoordinator & DebounceTimer

    [ https://issues.apache.org/jira/browse/SAMZA-1150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15997939#comment-15997939 ] 

Navina Ramesh commented on SAMZA-1150:
--------------------------------------

For simplicity and to be robust for 0.13.0 release timeline, we are going to treat all error conditions as FATAL in jobcoordinator and shutdown the streamprocessor. 
A complete solution addressing the questions listed in this JIRA will be done in a follow-up release.

> Handling Error propagation between ZkJobCoordinator & DebounceTimer
> -------------------------------------------------------------------
>
>                 Key: SAMZA-1150
>                 URL: https://issues.apache.org/jira/browse/SAMZA-1150
>             Project: Samza
>          Issue Type: Bug
>            Reporter: Navina Ramesh
>            Assignee: Navina Ramesh
>             Fix For: 0.13.0
>
>
> Right now, I don't think we handle errors from any of the callback properly. As far as I can tell, most of the errors should cause the processor to stop and propagate the error to the user. But we need to carefully evaluate the various error scenarios.
> Through this JIRA, we need to answer the following:
> 1. How to handle errors/exceptions thrown in the callbacks that are executed with the debounce timer?
> 2. Debounce timer has a category of tasks to run. What happens when one task fails? Do we cancel the other pending tasks, if any ? Does the ordering of the tasks matter at all? 
> 3. What kind of failures should be considered FATAL? 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)