You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Jason Lowe (JIRA)" <ji...@apache.org> on 2012/06/26 23:48:44 UTC

[jira] [Created] (MAPREDUCE-4376) TestClusterMRNotification times out

Jason Lowe created MAPREDUCE-4376:
-------------------------------------

             Summary: TestClusterMRNotification times out
                 Key: MAPREDUCE-4376
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mrv2, test
    Affects Versions: 2.0.1-alpha
            Reporter: Jason Lowe


The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kihwal Lee updated MAPREDUCE-4376:
----------------------------------

    Status: Patch Available  (was: Open)
    
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403848#comment-13403848 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Hdfs-trunk #1091 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1091/])
    MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)

     Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy closed MAPREDUCE-4376.
------------------------------------

    
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.0-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 0.23.3, 2.0.2-alpha
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403475#comment-13403475 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk-Commit #2423 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2423/])
    MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)

     Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kihwal Lee updated MAPREDUCE-4376:
----------------------------------

    Fix Version/s: 3.0.0
                   2.0.1-alpha
    
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kihwal Lee updated MAPREDUCE-4376:
----------------------------------

    Attachment: mapreduce-4376.patch

What this patch does:
- Fixes the NPE bug in {{RMContainerAllocator}}.
- Improves {{UtilsForTests}} by making the kill/fail job runner to timeout.
- Improves {{NotificationTestCase}} by having it check for more failure conditions.

{{TestJobHistory}}, {{TestJobInProgressListener}} and {{TestJobKillAndFail}} also call the kill/fail job runner in {{UtilsForTests}}. They were all tested okay with the new timeout.
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402319#comment-13402319 ] 

Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------

It used to be

job 1, SUCCEEDED, SUCCEEDED
job 2, KILLED, KILLED
job 3, FAILED, FAILED

Now it's getting

job 1, SUCCEEDED, SUCCEEDED
job 2, ERROR, ERROR

The test hangs after job 2. 

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403409#comment-13403409 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Hdfs-trunk-Commit #2472 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2472/])
    MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)

     Result = SUCCESS
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403237#comment-13403237 ] 

Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------

- Also verified that the timeout works when the bug fix is missing.

{noformat}
-------------------------------------------------------------------------------
Test set: org.apache.hadoop.mapred.TestClusterMRNotification
-------------------------------------------------------------------------------
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 77.437 sec <<< FAILURE!
testMR(org.apache.hadoop.mapred.TestClusterMRNotification)  Time elapsed: 77.365 sec  <<< ERROR!
java.io.IOException: Job cleanup didn't start in 30 seconds
        at org.apache.hadoop.mapred.UtilsForTests.runJobKill(UtilsForTests.java:676)
        at org.apache.hadoop.mapred.NotificationTestCase.testMR(NotificationTestCase.java:174)
{noformat}
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13408652#comment-13408652 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Hdfs-0.23-Build #306 (See [https://builds.apache.org/job/Hadoop-Hdfs-0.23-Build/306/])
    svn merge -c 1355124 FIXES: MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1358418)

     Result = UNSTABLE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1358418
Files : 
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 0.23.3, 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Assigned] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kihwal Lee reassigned MAPREDUCE-4376:
-------------------------------------

    Assignee: Kihwal Lee
    
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402359#comment-13402359 ] 

Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------

Relevant log entries:

{noformat}
2012-06-27 08:48:55,331 INFO [IPC Server handler 0 on 57856] org.apache.hadoop.mapreduce.v2.app.client.MRClie
ntService: Kill Job received from client job_1340812108963_0002
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobI
mpl: job_1340812108963_0002Job Transitioned from RUNNING to KILL_WAIT
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_m_000000 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_m_000001 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,333 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_r_000000 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,334 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
AttemptImpl: attempt_1340812108963_0002_m_000000_0 TaskAttempt Transitioned 
from UNASSIGNED to KILLED
2012-06-27 08:48:55,334 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1340812108963_0002_m_000001_0 TaskAttempt Transitioned from UNASSIGNED to KILLED
2012-06-27 08:48:55,335 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1340812108963_0002_r_000000_0 TaskAttempt Transitioned from UNASSIGNED to KILLED
2012-06-27 08:48:55,335 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,338 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_m_000000 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,338 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,338 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_m_000001 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,338 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_r_000000 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 1
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 2
2012-06-27 08:48:55,340 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 3
2012-06-27 08:48:55,341 ERROR [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Error in handling event type CONTAINER_DEALLOCATE to the ContainreAllocator
java.lang.NullPointerException
        at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator$AssignedRequests.get(RMContainerAllocator.java:1103)
        at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator.handleEvent(RMContainerAllocator.java:339)
        at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator$1.run(RMContainerAllocator.java:191)
2012-06-27 08:48:55,348 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1340812108963_0002Job Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,348 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1340812108963_0002Job Transitioned from KILLED to ERROR
{noformat}

The code assumes that if the attempt ID is not found in scheduledRequests, it will be in assignedRequests. But in this case, it was still in UNASSIGNED.
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Joseph Evans updated MAPREDUCE-4376:
-------------------------------------------

    Fix Version/s: 0.23.3
    
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 0.23.3, 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403318#comment-13403318 ] 

Hadoop QA commented on MAPREDUCE-4376:
--------------------------------------

+1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12533846/mapreduce-4376.patch
  against trunk revision .

    +1 @author.  The patch does not contain any @author tags.

    +1 tests included.  The patch appears to include 2 new or modified test files.

    +1 javac.  The applied patch does not increase the total number of javac compiler warnings.

    +1 javadoc.  The javadoc tool did not generate any warning messages.

    +1 eclipse:eclipse.  The patch built with eclipse:eclipse.

    +1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) warnings.

    +1 release audit.  The applied patch does not increase the total number of release audit warnings.

    +1 core tests.  The patch passed unit tests in hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient.

    +1 contrib tests.  The patch passed contrib unit tests.

Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2526//testReport/
Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2526//console

This message is automatically generated.
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403413#comment-13403413 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Common-trunk-Commit #2404 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2404/])
    MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)

     Result = SUCCESS
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Joseph Evans updated MAPREDUCE-4376:
-------------------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

Thanks Kihwal,  I put this into trunk and branch-2
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402499#comment-13402499 ] 

Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------

There is a check for null to handle transitions from UNASSIGNED state, but the check doesn't work anymore because  assignedRequest.get() throws NPE after the following change from MAPREDUCE-3921.  

{noformat}
     ContainerId get(TaskAttemptId tId) {
       if (tId.getTaskId().getTaskType().equals(TaskType.MAP)) {
-        return maps.get(tId);
+        return maps.get(tId).getId();
       } else {
-        return reduces.get(tId);
+        return reduces.get(tId).getId();
       }
     }
{noformat}

Jason has also suggested we put a time limit in these jobs so that they don't hang even if something goes wrong.
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403399#comment-13403399 ] 

Robert Joseph Evans commented on MAPREDUCE-4376:
------------------------------------------------

The changes look good to me.  All of the changes are to test code, and Jenkins gave it a +1 so I give it a +1 too.  Thanks for the fixes Kihwal I'll check them in.
                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times out

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403921#comment-13403921 ] 

Hudson commented on MAPREDUCE-4376:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk #1124 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1124/])
    MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)

     Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files : 
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java

                
> TestClusterMRNotification times out
> -----------------------------------
>
>                 Key: MAPREDUCE-4376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2, test
>    Affects Versions: 2.0.1-alpha
>            Reporter: Jason Lowe
>            Assignee: Kihwal Lee
>             Fix For: 2.0.1-alpha, 3.0.0
>
>         Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out.  git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira