You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Dave Latham (JIRA)" <ji...@apache.org> on 2009/05/05 00:48:30 UTC

[jira] Created: (HADOOP-5770) Deadlock in JobTracker initJobs

Deadlock in JobTracker initJobs
-------------------------------

                 Key: HADOOP-5770
                 URL: https://issues.apache.org/jira/browse/HADOOP-5770
             Project: Hadoop Core
          Issue Type: Bug
          Components: mapred
    Affects Versions: 0.19.1
         Environment: Hadoop 0.19.1 in a test environment with 2 Task Trackers
            Reporter: Dave Latham
            Priority: Critical


Found one Java-level deadlock:
=============================
"SocketListener0-26":
  waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
  which is held by "IPC Server handler 1 on 9001"
"IPC Server handler 1 on 9001":
  waiting to lock monitor 0x08f7da88 (object 0x5744f5b8, a org.apache.hadoop.mapred.JobInProgress),
  which is held by "initJobs"
"initJobs":
  waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
  which is held by "IPC Server handler 1 on 9001"

Java stack information for the threads listed above:
===================================================
"SocketListener0-26":
        at org.apache.hadoop.mapred.JobTracker.getClusterStatus(JobTracker.java:2313)
        - waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.jobtracker_jsp._jspService(jobtracker_jsp.java:104)
        at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:94)
        at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
        at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
        at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
        at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
        at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
        at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
        at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
        at org.mortbay.http.HttpServer.service(HttpServer.java:954)
        at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
        at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
        at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
        at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
        at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
        at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
"IPC Server handler 1 on 9001":
        at org.apache.hadoop.mapred.JobInProgress.obtainTaskCleanupTask(JobInProgress.java:935)
        - waiting to lock <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobTracker.getSetupAndCleanupTasks(JobTracker.java:2167)
        - locked <0x56795708> (a java.util.TreeMap)
        - locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.JobTracker.heartbeat(JobTracker.java:1902)
        - locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at sun.reflect.GeneratedMethodAccessor3278.invoke(Unknown Source)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:481)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:894)
"initJobs":
        at org.apache.hadoop.mapred.JobTracker.finalizeJob(JobTracker.java:1539)
        - waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
        at org.apache.hadoop.mapred.JobInProgress.garbageCollect(JobInProgress.java:2320)
        - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobInProgress.terminateJob(JobInProgress.java:2004)
        - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:472)
        - locked <0x575b7ec8> (a org.apache.hadoop.mapred.JobInProgress$JobInitKillStatus)
        - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
        at org.apache.hadoop.mapred.EagerTaskInitializationListener$JobInitThread.run(EagerTaskInitializationListener.java:55)

Found 1 deadlock.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HADOOP-5770) Deadlock in JobTracker initJobs

Posted by "Vinod K V (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HADOOP-5770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12709780#action_12709780 ] 

Vinod K V commented on HADOOP-5770:
-----------------------------------

I think this is fixed as part of HADOOP-5285 in 0.19.2 and above. Can you please confirm?

> Deadlock in JobTracker initJobs
> -------------------------------
>
>                 Key: HADOOP-5770
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5770
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.19.1
>         Environment: Hadoop 0.19.1 in a test environment with 2 Task Trackers
>            Reporter: Dave Latham
>            Priority: Critical
>
> Found one Java-level deadlock:
> =============================
> "SocketListener0-26":
>   waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
>   which is held by "IPC Server handler 1 on 9001"
> "IPC Server handler 1 on 9001":
>   waiting to lock monitor 0x08f7da88 (object 0x5744f5b8, a org.apache.hadoop.mapred.JobInProgress),
>   which is held by "initJobs"
> "initJobs":
>   waiting to lock monitor 0x08ed5ce4 (object 0x567924c0, a org.apache.hadoop.mapred.JobTracker),
>   which is held by "IPC Server handler 1 on 9001"
> Java stack information for the threads listed above:
> ===================================================
> "SocketListener0-26":
>         at org.apache.hadoop.mapred.JobTracker.getClusterStatus(JobTracker.java:2313)
>         - waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
>         at org.apache.hadoop.mapred.jobtracker_jsp._jspService(jobtracker_jsp.java:104)
>         at org.apache.jasper.runtime.HttpJspBase.service(HttpJspBase.java:94)
>         at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
>         at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
>         at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
>         at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
>         at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
>         at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
>         at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
>         at org.mortbay.http.HttpServer.service(HttpServer.java:954)
>         at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
>         at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
>         at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
>         at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
>         at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
>         at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
> "IPC Server handler 1 on 9001":
>         at org.apache.hadoop.mapred.JobInProgress.obtainTaskCleanupTask(JobInProgress.java:935)
>         - waiting to lock <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
>         at org.apache.hadoop.mapred.JobTracker.getSetupAndCleanupTasks(JobTracker.java:2167)
>         - locked <0x56795708> (a java.util.TreeMap)
>         - locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
>         at org.apache.hadoop.mapred.JobTracker.heartbeat(JobTracker.java:1902)
>         - locked <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
>         at sun.reflect.GeneratedMethodAccessor3278.invoke(Unknown Source)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:481)
>         at org.apache.hadoop.ipc.Server$Handler.run(Server.java:894)
> "initJobs":
>         at org.apache.hadoop.mapred.JobTracker.finalizeJob(JobTracker.java:1539)
>         - waiting to lock <0x567924c0> (a org.apache.hadoop.mapred.JobTracker)
>         at org.apache.hadoop.mapred.JobInProgress.garbageCollect(JobInProgress.java:2320)
>         - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
>         at org.apache.hadoop.mapred.JobInProgress.terminateJob(JobInProgress.java:2004)
>         - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
>         at org.apache.hadoop.mapred.JobInProgress.initTasks(JobInProgress.java:472)
>         - locked <0x575b7ec8> (a org.apache.hadoop.mapred.JobInProgress$JobInitKillStatus)
>         - locked <0x5744f5b8> (a org.apache.hadoop.mapred.JobInProgress)
>         at org.apache.hadoop.mapred.EagerTaskInitializationListener$JobInitThread.run(EagerTaskInitializationListener.java:55)
> Found 1 deadlock.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.