You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@falcon.apache.org by "Venkatesh Seetharam (JIRA)" <ji...@apache.org> on 2013/11/05 23:58:17 UTC

[jira] [Commented] (FALCON-168) Different validity of clusters in feed not handled

    [ https://issues.apache.org/jira/browse/FALCON-168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13814362#comment-13814362 ] 

Venkatesh Seetharam commented on FALCON-168:
--------------------------------------------

I think I found the offending code I have introduced with copy and paste error:
org.apache.falcon.converter.OozieFeedMapper.ReplicationOozieWorkflowMapper#createAndGetCoord

{code}
            Date sourceStartDate = getStartDate(feed, srcCluster, replicationDelayInMillis);
            Date sourceEndDate = getEndDate(feed, srcCluster);

            Date targetStartDate = getStartDate(feed, srcCluster, replicationDelayInMillis);
            Date targetEndDate = getEndDate(feed, srcCluster);
{code}
Notice the use of srcCluster as argument for both source and target. This should be:
{code}
            Date sourceStartDate = getStartDate(feed, srcCluster, replicationDelayInMillis);
            Date sourceEndDate = getEndDate(feed, srcCluster);

            Date targetStartDate = getStartDate(feed, trgCluster, replicationDelayInMillis);
            Date targetEndDate = getEndDate(feed, trgCluster);
{code}
Have added a few unit tests to cover this. Will have a patch in a bit.

> Different validity of clusters in feed not handled 
> ---------------------------------------------------
>
>                 Key: FALCON-168
>                 URL: https://issues.apache.org/jira/browse/FALCON-168
>             Project: Falcon
>          Issue Type: Bug
>          Components: replication
>         Environment: QA
>            Reporter: Samarth Gupta
>            Assignee: Venkatesh Seetharam
>
> Feed.xml has option to provide different validity for same feed on different clusters. When such a feed is scheduled the validity of source and target must be considered and overlapping time duration should be replicated. 
> Following test case fails for the patch available in FALCON-163
> 1. source validity : start="2012-10-01T12:00Z end="2099-01-01T00:00Z" 
> 2. target one validity : start="2012-10-01T12:05Z" end="2012-10-01T12:11Z"
> 3. target two validity : start="2012-10-01T12:10Z" end="2012-10-01T12:26Z"
> What should happen :
> based on the above, when is feed is scheduled replication start should be from:
> 1. target one : start="2012-10-01T12:05Z" and  end="2012-10-01T12:11Z"
> 2. target two:  start="2012-10-01T12:10Z" and end="2012-10-01T12:26Z"
> but ... What is happening after FALCON-163 patch:
> Both target one and two replication : start="2012-10-01T12:00Z" and  end="2099-01-01T00:00Z"
> Due to this instance are starting on target cluster when feed is not valid and such replication is failing with error "variable [distcpTargetPaths] cannot be resolved"
> feed refer to feed available in FALCON-95 for details. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)