You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Jason Lowe (JIRA)" <ji...@apache.org> on 2012/06/26 23:48:44 UTC
[jira] [Created] (MAPREDUCE-4376) TestClusterMRNotification times
out
Jason Lowe created MAPREDUCE-4376:
-------------------------------------
Summary: TestClusterMRNotification times out
Key: MAPREDUCE-4376
URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: mrv2, test
Affects Versions: 2.0.1-alpha
Reporter: Jason Lowe
The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kihwal Lee updated MAPREDUCE-4376:
----------------------------------
Status: Patch Available (was: Open)
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403848#comment-13403848 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Hdfs-trunk #1091 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/1091/])
MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)
Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Closed] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Arun C Murthy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Arun C Murthy closed MAPREDUCE-4376.
------------------------------------
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.0-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 0.23.3, 2.0.2-alpha
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403475#comment-13403475 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Mapreduce-trunk-Commit #2423 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2423/])
MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)
Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kihwal Lee updated MAPREDUCE-4376:
----------------------------------
Fix Version/s: 3.0.0
2.0.1-alpha
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kihwal Lee updated MAPREDUCE-4376:
----------------------------------
Attachment: mapreduce-4376.patch
What this patch does:
- Fixes the NPE bug in {{RMContainerAllocator}}.
- Improves {{UtilsForTests}} by making the kill/fail job runner to timeout.
- Improves {{NotificationTestCase}} by having it check for more failure conditions.
{{TestJobHistory}}, {{TestJobInProgressListener}} and {{TestJobKillAndFail}} also call the kill/fail job runner in {{UtilsForTests}}. They were all tested okay with the new timeout.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402319#comment-13402319 ]
Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------
It used to be
job 1, SUCCEEDED, SUCCEEDED
job 2, KILLED, KILLED
job 3, FAILED, FAILED
Now it's getting
job 1, SUCCEEDED, SUCCEEDED
job 2, ERROR, ERROR
The test hangs after job 2.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403409#comment-13403409 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Hdfs-trunk-Commit #2472 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk-Commit/2472/])
MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)
Result = SUCCESS
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403237#comment-13403237 ]
Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------
- Also verified that the timeout works when the bug fix is missing.
{noformat}
-------------------------------------------------------------------------------
Test set: org.apache.hadoop.mapred.TestClusterMRNotification
-------------------------------------------------------------------------------
Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time elapsed: 77.437 sec <<< FAILURE!
testMR(org.apache.hadoop.mapred.TestClusterMRNotification) Time elapsed: 77.365 sec <<< ERROR!
java.io.IOException: Job cleanup didn't start in 30 seconds
at org.apache.hadoop.mapred.UtilsForTests.runJobKill(UtilsForTests.java:676)
at org.apache.hadoop.mapred.NotificationTestCase.testMR(NotificationTestCase.java:174)
{noformat}
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13408652#comment-13408652 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Hdfs-0.23-Build #306 (See [https://builds.apache.org/job/Hadoop-Hdfs-0.23-Build/306/])
svn merge -c 1355124 FIXES: MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1358418)
Result = UNSTABLE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1358418
Files :
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/branches/branch-0.23/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 0.23.3, 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Assigned] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kihwal Lee reassigned MAPREDUCE-4376:
-------------------------------------
Assignee: Kihwal Lee
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402359#comment-13402359 ]
Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------
Relevant log entries:
{noformat}
2012-06-27 08:48:55,331 INFO [IPC Server handler 0 on 57856] org.apache.hadoop.mapreduce.v2.app.client.MRClie
ntService: Kill Job received from client job_1340812108963_0002
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobI
mpl: job_1340812108963_0002Job Transitioned from RUNNING to KILL_WAIT
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_m_000000 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,332 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_m_000001 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,333 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
Impl: task_1340812108963_0002_r_000000 Task Transitioned from SCHEDULED to KILL_WAIT
2012-06-27 08:48:55,334 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.Task
AttemptImpl: attempt_1340812108963_0002_m_000000_0 TaskAttempt Transitioned
from UNASSIGNED to KILLED
2012-06-27 08:48:55,334 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1340812108963_0002_m_000001_0 TaskAttempt Transitioned from UNASSIGNED to KILLED
2012-06-27 08:48:55,335 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskAttemptImpl: attempt_1340812108963_0002_r_000000_0 TaskAttempt Transitioned from UNASSIGNED to KILLED
2012-06-27 08:48:55,335 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,338 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_m_000000 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,338 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,338 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_m_000001 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,338 INFO [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Processing the event EventType: CONTAINER_DEALLOCATE
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.TaskImpl: task_1340812108963_0002_r_000000 Task Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 1
2012-06-27 08:48:55,339 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 2
2012-06-27 08:48:55,340 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: Num completed Tasks: 3
2012-06-27 08:48:55,341 ERROR [Thread-45] org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator: Error in handling event type CONTAINER_DEALLOCATE to the ContainreAllocator
java.lang.NullPointerException
at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator$AssignedRequests.get(RMContainerAllocator.java:1103)
at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator.handleEvent(RMContainerAllocator.java:339)
at org.apache.hadoop.mapreduce.v2.app.rm.RMContainerAllocator$1.run(RMContainerAllocator.java:191)
2012-06-27 08:48:55,348 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1340812108963_0002Job Transitioned from KILL_WAIT to KILLED
2012-06-27 08:48:55,348 INFO [AsyncDispatcher event handler] org.apache.hadoop.mapreduce.v2.app.job.impl.JobImpl: job_1340812108963_0002Job Transitioned from KILLED to ERROR
{noformat}
The code assumes that if the attempt ID is not found in scheduledRequests, it will be in assignedRequests. But in this case, it was still in UNASSIGNED.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Joseph Evans updated MAPREDUCE-4376:
-------------------------------------------
Fix Version/s: 0.23.3
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 0.23.3, 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hadoop QA (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403318#comment-13403318 ]
Hadoop QA commented on MAPREDUCE-4376:
--------------------------------------
+1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12533846/mapreduce-4376.patch
against trunk revision .
+1 @author. The patch does not contain any @author tags.
+1 tests included. The patch appears to include 2 new or modified test files.
+1 javac. The applied patch does not increase the total number of javac compiler warnings.
+1 javadoc. The javadoc tool did not generate any warning messages.
+1 eclipse:eclipse. The patch built with eclipse:eclipse.
+1 findbugs. The patch does not introduce any new Findbugs (version 1.3.9) warnings.
+1 release audit. The applied patch does not increase the total number of release audit warnings.
+1 core tests. The patch passed unit tests in hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient.
+1 contrib tests. The patch passed contrib unit tests.
Test results: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2526//testReport/
Console output: https://builds.apache.org/job/PreCommit-MAPREDUCE-Build/2526//console
This message is automatically generated.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403413#comment-13403413 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Common-trunk-Commit #2404 (See [https://builds.apache.org/job/Hadoop-Common-trunk-Commit/2404/])
MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)
Result = SUCCESS
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Joseph Evans updated MAPREDUCE-4376:
-------------------------------------------
Resolution: Fixed
Status: Resolved (was: Patch Available)
Thanks Kihwal, I put this into trunk and branch-2
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Kihwal Lee (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13402499#comment-13402499 ]
Kihwal Lee commented on MAPREDUCE-4376:
---------------------------------------
There is a check for null to handle transitions from UNASSIGNED state, but the check doesn't work anymore because assignedRequest.get() throws NPE after the following change from MAPREDUCE-3921.
{noformat}
ContainerId get(TaskAttemptId tId) {
if (tId.getTaskId().getTaskType().equals(TaskType.MAP)) {
- return maps.get(tId);
+ return maps.get(tId).getId();
} else {
- return reduces.get(tId);
+ return reduces.get(tId).getId();
}
}
{noformat}
Jason has also suggested we put a time limit in these jobs so that they don't hang even if something goes wrong.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Robert Joseph Evans (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403399#comment-13403399 ]
Robert Joseph Evans commented on MAPREDUCE-4376:
------------------------------------------------
The changes look good to me. All of the changes are to test code, and Jenkins gave it a +1 so I give it a +1 too. Thanks for the fixes Kihwal I'll check them in.
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (MAPREDUCE-4376) TestClusterMRNotification times
out
Posted by "Hudson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/MAPREDUCE-4376?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13403921#comment-13403921 ]
Hudson commented on MAPREDUCE-4376:
-----------------------------------
Integrated in Hadoop-Mapreduce-trunk #1124 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1124/])
MAPREDUCE-4376. TestClusterMRNotification times out (Kihwal Lee via bobby) (Revision 1355124)
Result = FAILURE
bobby : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1355124
Files :
* /hadoop/common/trunk/hadoop-mapreduce-project/CHANGES.txt
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/RMContainerAllocator.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/NotificationTestCase.java
* /hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient/src/test/java/org/apache/hadoop/mapred/UtilsForTests.java
> TestClusterMRNotification times out
> -----------------------------------
>
> Key: MAPREDUCE-4376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mrv2, test
> Affects Versions: 2.0.1-alpha
> Reporter: Jason Lowe
> Assignee: Kihwal Lee
> Fix For: 2.0.1-alpha, 3.0.0
>
> Attachments: mapreduce-4376.patch
>
>
> The TestClusterMRNotification test is often timing out. git bisect tests narrowed it down to MAPREDUCE-3921, as the test consistently passes before that change and times out most of the time after picking up that change.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira