You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Xing Shi (JIRA)" <ji...@apache.org> on 2009/05/04 08:16:30 UTC

[jira] Commented: (HADOOP-5760) Task process hanging on an RPC call

    [ https://issues.apache.org/jira/browse/HADOOP-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12705496#action_12705496 ] 

Xing Shi commented on HADOOP-5760:
----------------------------------

I also found another hanging with a child java process with several days:

"main" prio=10 tid=0x0000000040114000 nid=0x6900 in Object.wait() [0x000000004022a000..0x000000004022af70]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x0000002a9eb13658> (a java.util.LinkedList)
        at java.lang.Object.wait(Object.java:485)
        at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.flushInternal(DFSClient.java:3041)
        - locked <0x0000002a9eb13658> (a java.util.LinkedList)
        - locked <0x0000002a9eb12458> (a org.apache.hadoop.hdfs.DFSClient$DFSOutputStream)
        at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.closeInternal(DFSClient.java:3130)
        - locked <0x0000002a9eb12458> (a org.apache.hadoop.hdfs.DFSClient$DFSOutputStream)
        at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.close(DFSClient.java:3079)
        at org.apache.hadoop.fs.FSDataOutputStream$PositionCache.close(FSDataOutputStream.java:61)
        at org.apache.hadoop.fs.FSDataOutputStream.close(FSDataOutputStream.java:86)
        at org.apache.hadoop.mapred.TextOutputFormat$LineRecordWriter.close(TextOutputFormat.java:102)
        - locked <0x0000002a9eb26630> (a org.apache.hadoop.mapred.TextOutputFormat$LineRecordWriter)
        at org.apache.hadoop.mapred.MapTask$DirectMapOutputCollector.close(MapTask.java:375)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:337)
        at org.apache.hadoop.mapred.Child.main(Child.java:174)


> Task process hanging on an RPC call
> -----------------------------------
>
>                 Key: HADOOP-5760
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5760
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: ipc
>            Reporter: Devaraj Das
>
> On a random node on a cluster, I found one task process waiting on an RPC call. The process has been in that state for a few days at least.
> "main" prio=10 tid=0x08069400 nid=0x6f52 in Object.wait() [0xf7e6c000..0xf7e6d1f8]
>    java.lang.Thread.State: WAITING (on object monitor)
>         at java.lang.Object.wait(Native Method)
>         - waiting on <0xf1215700> (a org.apache.hadoop.ipc.Client$Call)
>         at java.lang.Object.wait(Object.java:485)
>         at org.apache.hadoop.ipc.Client.call(Client.java:725)
>         - locked <0xf1215700> (a org.apache.hadoop.ipc.Client$Call)
>         at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
>         at org.apache.hadoop.mapred.$Proxy0.statusUpdate(Unknown Source)
>         at org.apache.hadoop.mapred.Task.statusUpdate(Task.java:691)
>         at org.apache.hadoop.mapred.Task.taskCleanup(Task.java:795)
>         at org.apache.hadoop.mapred.Child.main(Child.java:176)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.