You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flume.apache.org by "Jonathan Hsieh (JIRA)" <ji...@apache.org> on 2011/08/19 04:52:27 UTC

[jira] [Commented] (FLUME-745) Fix Race condition in NaiveFileWALDeco and retransmit logic

    [ https://issues.apache.org/jira/browse/FLUME-745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13087466#comment-13087466 ] 

Jonathan Hsieh commented on FLUME-745:
--------------------------------------

There was code snippet posted previously but it is incorrect.

The main issue is that sychrnozation in the NaiveFileWALManager isn't handled properly with the normal statemachine and when another thread triggers a retransmit due to a retry timeout.  

> Fix Race condition in NaiveFileWALDeco and retransmit logic
> -----------------------------------------------------------
>
>                 Key: FLUME-745
>                 URL: https://issues.apache.org/jira/browse/FLUME-745
>             Project: Flume
>          Issue Type: Bug
>    Affects Versions: v0.9.5
>            Reporter: Jonathan Hsieh
>
> There is a race condition in state transtiions that happen in the NaiveFileWALDeco and retransmits.  This condition is fairly rare but when it occurs it cause an agent or collector to hang.  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira