You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2009/02/11 19:46:59 UTC

[jira] Created: (HBASE-1196) OOME in HRS IPC server causes infinite client stall

OOME in HRS IPC server causes infinite client stall
---------------------------------------------------

                 Key: HBASE-1196
                 URL: https://issues.apache.org/jira/browse/HBASE-1196
             Project: Hadoop HBase
          Issue Type: Bug
            Reporter: Andrew Purtell
            Priority: Critical


OOME in IPC server handler causes the IPC handler to abort, but the client never learns about this, so it waits and waits and waits... I have seen Heritrix writer threads that have been waiting for 7+ hours. And, the OOME does not take down the HRS, so it stays up in some degraded state.  E.g.:

java.lang.OutOfMemoryError: Java heap space
Dumping heap to java_pid13008.hprof
Exception in thread "IPC Server handler 5 on 60020" java.lang.OutOfMemoryError: Java heap space
at java.util.Arrays.copyOf(Arrays.java:2786)
at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
at java.io.DataOutputStream.write(DataOutputStream.java:90)
at org.apache.hadoop.hbase.util.Bytes.writeByteArray(Bytes.java:82)
at org.apache.hadoop.hbase.io.Cell.write(Cell.java:162)
at org.apache.hadoop.hbase.io.HbaseMapWritable.write(HbaseMapWritable.java:200)
at org.apache.hadoop.hbase.io.RowResult.write(RowResult.java:249)
at org.apache.hadoop.hbase.io.HbaseObjectWritable.writeObject(HbaseObjectWritable.java:300)
at org.apache.hadoop.hbase.io.HbaseObjectWritable.write(HbaseObjectWritable.java:262)
at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:917)
Exception in thread "IPC Server handler 7 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 4 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 2 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 3 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 0 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 6 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 9 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 1 on 60020" java.lang.OutOfMemoryError: Java heap space
Exception in thread "IPC Server handler 8 on 60020" java.lang.OutOfMemoryError: Java heap space


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HBASE-1196) OOME in HRS IPC server causes infinite client stall

Posted by "Andrew Purtell (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-1196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12672720#action_12672720 ] 

Andrew Purtell commented on HBASE-1196:
---------------------------------------

Specifically in the case of my usage pattern, an OOME cascade like the above will damage IPC during a scan, and subsequent writes from the client are what stall forever. 

> OOME in HRS IPC server causes infinite client stall
> ---------------------------------------------------
>
>                 Key: HBASE-1196
>                 URL: https://issues.apache.org/jira/browse/HBASE-1196
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>            Assignee: Andrew Purtell
>            Priority: Critical
>
> OOME in IPC server handler causes the IPC handler to abort, but the client never learns about this, so it waits and waits and waits... I have seen Heritrix writer threads that have been waiting for 7+ hours. And, the OOME does not take down the HRS, so it stays up in some degraded state.  E.g.:
> java.lang.OutOfMemoryError: Java heap space
> Dumping heap to java_pid13008.hprof
> Exception in thread "IPC Server handler 5 on 60020" java.lang.OutOfMemoryError: Java heap space
> at java.util.Arrays.copyOf(Arrays.java:2786)
> at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
> at java.io.DataOutputStream.write(DataOutputStream.java:90)
> at org.apache.hadoop.hbase.util.Bytes.writeByteArray(Bytes.java:82)
> at org.apache.hadoop.hbase.io.Cell.write(Cell.java:162)
> at org.apache.hadoop.hbase.io.HbaseMapWritable.write(HbaseMapWritable.java:200)
> at org.apache.hadoop.hbase.io.RowResult.write(RowResult.java:249)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.writeObject(HbaseObjectWritable.java:300)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.write(HbaseObjectWritable.java:262)
> at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:917)
> Exception in thread "IPC Server handler 7 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 4 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 2 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 3 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 0 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 6 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 9 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 1 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 8 on 60020" java.lang.OutOfMemoryError: Java heap space

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (HBASE-1196) OOME in HRS IPC server causes infinite client stall

Posted by "Andrew Purtell (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Purtell reassigned HBASE-1196:
-------------------------------------

    Assignee: Andrew Purtell

> OOME in HRS IPC server causes infinite client stall
> ---------------------------------------------------
>
>                 Key: HBASE-1196
>                 URL: https://issues.apache.org/jira/browse/HBASE-1196
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>            Assignee: Andrew Purtell
>            Priority: Critical
>
> OOME in IPC server handler causes the IPC handler to abort, but the client never learns about this, so it waits and waits and waits... I have seen Heritrix writer threads that have been waiting for 7+ hours. And, the OOME does not take down the HRS, so it stays up in some degraded state.  E.g.:
> java.lang.OutOfMemoryError: Java heap space
> Dumping heap to java_pid13008.hprof
> Exception in thread "IPC Server handler 5 on 60020" java.lang.OutOfMemoryError: Java heap space
> at java.util.Arrays.copyOf(Arrays.java:2786)
> at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
> at java.io.DataOutputStream.write(DataOutputStream.java:90)
> at org.apache.hadoop.hbase.util.Bytes.writeByteArray(Bytes.java:82)
> at org.apache.hadoop.hbase.io.Cell.write(Cell.java:162)
> at org.apache.hadoop.hbase.io.HbaseMapWritable.write(HbaseMapWritable.java:200)
> at org.apache.hadoop.hbase.io.RowResult.write(RowResult.java:249)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.writeObject(HbaseObjectWritable.java:300)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.write(HbaseObjectWritable.java:262)
> at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:917)
> Exception in thread "IPC Server handler 7 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 4 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 2 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 3 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 0 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 6 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 9 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 1 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 8 on 60020" java.lang.OutOfMemoryError: Java heap space

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (HBASE-1196) OOME in HRS IPC server causes infinite client stall

Posted by "Andrew Purtell (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Purtell resolved HBASE-1196.
-----------------------------------

       Resolution: Fixed
    Fix Version/s: 0.20.0
                   0.19.1

Resolved by subtask.

> OOME in HRS IPC server causes infinite client stall
> ---------------------------------------------------
>
>                 Key: HBASE-1196
>                 URL: https://issues.apache.org/jira/browse/HBASE-1196
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: Andrew Purtell
>            Assignee: Andrew Purtell
>            Priority: Critical
>             Fix For: 0.19.1, 0.20.0
>
>
> OOME in IPC server handler causes the IPC handler to abort, but the client never learns about this, so it waits and waits and waits... I have seen Heritrix writer threads that have been waiting for 7+ hours. And, the OOME does not take down the HRS, so it stays up in some degraded state.  E.g.:
> java.lang.OutOfMemoryError: Java heap space
> Dumping heap to java_pid13008.hprof
> Exception in thread "IPC Server handler 5 on 60020" java.lang.OutOfMemoryError: Java heap space
> at java.util.Arrays.copyOf(Arrays.java:2786)
> at java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:94)
> at java.io.DataOutputStream.write(DataOutputStream.java:90)
> at org.apache.hadoop.hbase.util.Bytes.writeByteArray(Bytes.java:82)
> at org.apache.hadoop.hbase.io.Cell.write(Cell.java:162)
> at org.apache.hadoop.hbase.io.HbaseMapWritable.write(HbaseMapWritable.java:200)
> at org.apache.hadoop.hbase.io.RowResult.write(RowResult.java:249)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.writeObject(HbaseObjectWritable.java:300)
> at org.apache.hadoop.hbase.io.HbaseObjectWritable.write(HbaseObjectWritable.java:262)
> at org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:917)
> Exception in thread "IPC Server handler 7 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 4 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 2 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 3 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 0 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 6 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 9 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 1 on 60020" java.lang.OutOfMemoryError: Java heap space
> Exception in thread "IPC Server handler 8 on 60020" java.lang.OutOfMemoryError: Java heap space

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.