You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Siddharth Seth (JIRA)" <ji...@apache.org> on 2012/08/15 20:39:38 UTC

[jira] [Created] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Siddharth Seth created MAPREDUCE-4558:
-----------------------------------------

             Summary: TestJobTrackerSafeMode is failing
                 Key: MAPREDUCE-4558
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 1.2.0
            Reporter: Siddharth Seth
            Assignee: Siddharth Seth


MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).

The test ends up getting stuck, with the following message repeated multiple times.
{code}
    [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
    [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
    [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
    [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
{code}



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Siddharth Seth (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Siddharth Seth updated MAPREDUCE-4558:
--------------------------------------

    Attachment: MR4558.txt

Trivial patch - which changes the check to 3 TTs instead of 2. (Also disables the test)

The test passes, but ends up generating cyclic attempt dir symlinks, which cause subsequent ant operations to fail.

build/test/logs/userlogs/job_20120815112512392_0001/attempt_20120815112512392_0001_m_000000_0/attempt_20120815112512392_0001_m_000000_0/attempt_20120815112512392_0001_m_000000_0/..

The unit test was enabled as part of MAPREDUCE-3837. I'd like to disable it again for now, with a separate jira to fix the symlinks.
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Siddharth Seth (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Siddharth Seth resolved MAPREDUCE-4558.
---------------------------------------

       Resolution: Fixed
    Fix Version/s: 1.2.0
     Hadoop Flags: Reviewed

Committed to branch-1. Thanks for the review Arun.

Mayank, I've created MAPREDUCE-4563 for the symlinks..
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 1.2.0
>
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Mayank Bansal (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435620#comment-13435620 ] 

Mayank Bansal commented on MAPREDUCE-4558:
------------------------------------------

yeah sure, I can take a look. 

Sid,

Did you create a JIRA or I can do that as well?

Thanks,
Mayank
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13462877#comment-13462877 ] 

Arun C Murthy commented on MAPREDUCE-4558:
------------------------------------------

Matt - if you don't mind, I'd like to merge this into branch-1.1 since it's been well baked-in. Thoughts?
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 1.2.0
>
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435530#comment-13435530 ] 

Arun C Murthy commented on MAPREDUCE-4558:
------------------------------------------

Pinging Mayank _after_ adding him to the watch list... that should help! :)
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Matt Foley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13465147#comment-13465147 ] 

Matt Foley commented on MAPREDUCE-4558:
---------------------------------------

Accepted.
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.1.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 1.1.0
>
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13435529#comment-13435529 ] 

Arun C Murthy commented on MAPREDUCE-4558:
------------------------------------------

bq. The unit test was enabled as part of MAPREDUCE-3837. I'd like to disable it again for now, with a separate jira to fix the symlinks.

+1, this test seems like it needs more work. Mayank, will you have time to look at this? Thanks.


                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4558) TestJobTrackerSafeMode is failing

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy updated MAPREDUCE-4558:
-------------------------------------

    Affects Version/s:     (was: 1.2.0)
                       1.1.0
        Fix Version/s:     (was: 1.2.0)
                       1.1.0

I merged this to branch-1.1 too.
                
> TestJobTrackerSafeMode is failing
> ---------------------------------
>
>                 Key: MAPREDUCE-4558
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4558
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.1.0
>            Reporter: Siddharth Seth
>            Assignee: Siddharth Seth
>             Fix For: 1.1.0
>
>         Attachments: MR4558.txt
>
>
> MAPREDUCE-1906 exposed an issue with this unit test. It has 3 TTs running, but has a check for the TT count to reach exactly 2 (which would be reached with a higher heartbeat interval).
> The test ends up getting stuck, with the following message repeated multiple times.
> {code}
>     [junit] 2012-08-15 11:26:46,299 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:47,301 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:48,302 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
>     [junit] 2012-08-15 11:26:49,303 INFO  mapred.TestJobTrackerSafeMode (TestJobTrackerSafeMode.java:checkTrackers(201)) - Waiting for Initialize all Task Trackers
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira