You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "ChiaPing Tsai (JIRA)" <ji...@apache.org> on 2017/01/23 09:24:26 UTC
[jira] [Updated] (HBASE-17510) DefaultMemStore gets the wrong heap
size after rollback
[ https://issues.apache.org/jira/browse/HBASE-17510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ChiaPing Tsai updated HBASE-17510:
----------------------------------
Assignee: ChiaPing Tsai
Status: Patch Available (was: Open)
> DefaultMemStore gets the wrong heap size after rollback
> -------------------------------------------------------
>
> Key: HBASE-17510
> URL: https://issues.apache.org/jira/browse/HBASE-17510
> Project: HBase
> Issue Type: Bug
> Reporter: ChiaPing Tsai
> Assignee: ChiaPing Tsai
> Fix For: 1.4.0
>
> Attachments: HBASE-17510.branch-1.v0.patch
>
>
> We should calculate the size of “found” rather than “cell” because the offset value may cause the difference heap size between “cell” and “found”.
> {code:title=DefaultMemStore.java|borderStyle=solid}
> @Override
> public void rollback(Cell cell) {
> // If the key is in the memstore, delete it. Update this.size.
> found = this.cellSet.get(cell);
> if (found != null && found.getSequenceId() == cell.getSequenceId()) {
> removeFromCellSet(cell);
> long s = heapSizeChange(cell, true);
> this.size.addAndGet(-s);
> }
> }
> {code}
> {code:title=KeyValue.java|borderStyle=solid}
> @Override
> public long heapSize() {
> return ClassSize.align(sum) +
> (offset == 0
> ? ClassSize.sizeOf(bytes, length) // count both length and object overhead
> : length); // only count the number of bytes
> }
> {code}
> The wrong heap size of store will block the HRegion#doClose because the HRegion#memstoreSize will always be bigger than zero even if we flush the store.
> {code:title=HRegion.java|borderStyle=solid}
> while (this.memstoreSize.get() > 0) {
> try {
> if (flushCount++ > 0) {
> int actualFlushes = flushCount - 1;
> if (actualFlushes > 5) {
> // If we tried 5 times and are unable to clear memory, abort
> // so we do not lose data
> throw new DroppedSnapshotException("Failed clearing memory after " +
> actualFlushes + " attempts on region: " +
> Bytes.toStringBinary(getRegionInfo().getRegionName()));
> }
> LOG.info("Running extra flush, " + actualFlushes +
> " (carrying snapshot?) " + this);
> }
> internalFlushcache(status);
> } catch (IOException ioe) {
> status.setStatus("Failed flush " + this + ", putting online again");
> synchronized (writestate) {
> writestate.writesEnabled = true;
> }
> // Have to throw to upper layers. I can't abort server from here.
> throw ioe;
> }
> }
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)