You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hbase.apache.org by Pere Kyle <pe...@whisper.sh> on 2014/11/14 02:54:31 UTC

Avoid GC Pauses on Scan MapReduces

Hi there,

Recently I have been experiencing instability when scanning our HBASE cluster. The table we are trying to scan is 1.5B records 1TB, we have 12GB heap and 17 servers. Our GC options are as so:
-XX:OnOutOfMemoryError=kill -9 %p -Xmx12000m -XX:+UseConcMarkSweepGC -Xmx12g -Xmx12g

The errors I am seeing in the logs:
2014-11-14 00:00:32,656 WARN org.apache.hadoop.hbase.util.Sleeper (regionserver60020): We slept 17355ms instead of 3000ms, this is likely due to a long garbage collecting pause and it's usually bad, see http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired
2014-11-14 00:00:32,663 WARN org.apache.hadoop.hbase.util.Sleeper (regionserver60020.periodicFlusher): We slept 24734ms instead of 10000ms, this is likely due to a long garbage collecting pause and it's usually bad, see http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired

We are using hbase.client.scanner.caching=1000. I suspect this may be a block cache issue. My question is if/how to disable the block cache for the scan queries? This is taking out writes and causing instability on the cluster.

Thanks,
Pere