You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@falcon.apache.org by "Peeyush Bishnoi (JIRA)" <ji...@apache.org> on 2015/05/06 13:20:00 UTC

[jira] [Commented] (FALCON-1165) Falcon restart failed, if defined service in cluster entity is unreachable

    [ https://issues.apache.org/jira/browse/FALCON-1165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14530358#comment-14530358 ] 

Peeyush Bishnoi commented on FALCON-1165:
-----------------------------------------

Patch is attached that will restart the Falcon server on source cluster, if service like HDFS is down/unreachable on remote cluster. [~venkatnrangan] Please review.

> Falcon restart failed, if defined service in cluster entity is unreachable
> --------------------------------------------------------------------------
>
>                 Key: FALCON-1165
>                 URL: https://issues.apache.org/jira/browse/FALCON-1165
>             Project: Falcon
>          Issue Type: Bug
>            Reporter: Peeyush Bishnoi
>            Assignee: Peeyush Bishnoi
>             Fix For: 0.7
>
>         Attachments: FALCON-1165.patch
>
>
> Falcon fail to restart, if any service in the cluster entity is not reachable or down.
> For example, if there are clusters X, Y, Z. In cluster X, submit cluster entities which points to services of cluster Y & Z. Execute some replication jobs from cluster X to Y and even to cluster Z as well. If after certain duration, cluster Z HDFS service is down due to maintenance activity and at the same time we require to restart Falcon service on cluster X due to some reason, then Falcon will fail to restart on cluster X. 
> This issue has been reported internally at Hortonworks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)