You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flume.apache.org by "E. Sammer (JIRA)" <ji...@apache.org> on 2011/08/10 23:27:27 UTC

[jira] [Commented] (FLUME-706) Flume nodes launch duplicate logical nodes

    [ https://issues.apache.org/jira/browse/FLUME-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13082692#comment-13082692 ] 

E. Sammer commented on FLUME-706:
---------------------------------

The rough version of what I did as an emergency fix for a specific user is now up at https://review.cloudera.org/r/1896/

> Flume nodes launch duplicate logical nodes
> ------------------------------------------
>
>                 Key: FLUME-706
>                 URL: https://issues.apache.org/jira/browse/FLUME-706
>             Project: Flume
>          Issue Type: Bug
>          Components: Master, Node
>    Affects Versions: v0.9.5
>            Reporter: E. Sammer
>            Assignee: E. Sammer
>            Priority: Critical
>             Fix For: v0.9.5
>
>         Attachments: FLUME-706.log
>
>
> When submitting a config command to the flume master, it seems as if the downstream node attempts to load the config twice.
> In a test case, starting a single master and a single node, I submitted a "config node rpcSource(12345) console". The node sees the config change on the next heartbeat and updates its config and starts the thrift source on port 12345. Immediately after, it logs "Taking another heartbeat" (DEBUG) and attempts to create another logical node with the same config. This leads to thrift errors in bind() and "Could not create ServerSocket on address ...". Looking at the root cause in a debugger (thrift swallows the original exception) I can see it's an "Address already in use" IOException.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira