You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Hadoop QA (JIRA)" <ji...@apache.org> on 2016/11/16 15:09:58 UTC
[jira] [Commented] (HBASE-17113) finding middle key in HFileV2 is always wrong and can cause IndexOutOfBoundsException

    [ https://issues.apache.org/jira/browse/HBASE-17113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15670651#comment-15670651 ] 

Hadoop QA commented on HBASE-17113:
-----------------------------------

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s {color} | {color:blue} Docker mode activated. {color} |
| {color:red}-1{color} | {color:red} patch {color} | {color:red} 0m 7s {color} | {color:red} HBASE-17113 does not apply to branch-1. Rebase required? Wrong Branch? See https://yetus.apache.org/documentation/0.3.0/precommit-patchnames for help. {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Patch URL | https://issues.apache.org/jira/secure/attachment/12839183/HBASE-17113-branch-1.patch |
| JIRA Issue | HBASE-17113 |
| Console output | https://builds.apache.org/job/PreCommit-HBASE-Build/4496/console |
| Powered by | Apache Yetus 0.3.0   http://yetus.apache.org |


This message was automatically generated.



> finding middle key in HFileV2 is always wrong and can cause IndexOutOfBoundsException 
> --------------------------------------------------------------------------------------
>
>                 Key: HBASE-17113
>                 URL: https://issues.apache.org/jira/browse/HBASE-17113
>             Project: HBase
>          Issue Type: Bug
>          Components: HFile
>    Affects Versions: 0.94.17, 2.0.0, 1.1.7, 0.98.23, 1.2.4
>            Reporter: Allan Yang
>            Assignee: Allan Yang
>         Attachments: HBASE-17113-branch-1.patch
>
>
> When we want  to split a region, we need to  get the middle rowkey from the biggest store file. 
> Here is the code from HFileBlockIndex.midkey() which help us find a approximation middle key.
> {code}
> // Caching, using pread, assuming this is not a compaction.
>         HFileBlock midLeafBlock = cachingBlockReader.readBlock(
>             midLeafBlockOffset, midLeafBlockOnDiskSize, true, true, false, true,
>             BlockType.LEAF_INDEX, null);
>         ByteBuffer b = midLeafBlock.getBufferWithoutHeader();
>         int numDataBlocks = b.getInt();
>         int keyRelOffset = b.getInt(Bytes.SIZEOF_INT * (midKeyEntry + 1));
>         int keyLen = b.getInt(Bytes.SIZEOF_INT * (midKeyEntry + 2)) -
>             keyRelOffset - SECONDARY_INDEX_ENTRY_OVERHEAD;
>         int keyOffset = Bytes.SIZEOF_INT * (numDataBlocks + 2) + keyRelOffset
>             + SECONDARY_INDEX_ENTRY_OVERHEAD;
>         targetMidKey = ByteBufferUtils.toBytes(b, keyOffset, keyLen);
> {code}
> and in each entry of Non-root block index contains three object:
> 1. Offset of the block referenced by this entry in the file (long)
> 2 .Ondisk size of the referenced block (int)
> 3. RowKey. 
> But when we caculating the keyLen from the entry, we forget to take away the 12 byte overhead(1,2 above, SECONDARY_INDEX_ENTRY_OVERHEAD in the code). So the keyLen is always 12 bytes bigger than the real rowkey length.
> Every time we read the rowkey form the entry, we read 12 bytes from the next entry. 
> No exception will throw unless the middle key is in the last entry of the Non-root block index. which will cause a IndexOutOfBoundsException. That is exactly what HBASE-16097 is suffering from.
> {code}
> 2016-11-16 05:27:31,991 ERROR [MemStoreFlusher.1] regionserver.MemStoreFlusher: Cache flusher failed for entry [flush region hitsdb,\x14\x03\x83\x1AX\x1A\x9A \x00\x00\x07\x00\x00\x07\x00\x00\x09\x00\x00\x09\x00\x01\x9F\x00F\xE3\x00\x00\x0A\x00\x01~\x00\x00\x08\x00\x5C\x09\x00\x03\x11\x00\xEF\x99,1478311873096.79d3f7f285396b6896f3229e2bcac7af.]
> java.lang.IndexOutOfBoundsException
>         at java.nio.Buffer.checkIndex(Buffer.java:532)
>         at java.nio.HeapByteBuffer.get(HeapByteBuffer.java:139)
>         at org.apache.hadoop.hbase.util.ByteBufferUtils.toBytes(ByteBufferUtils.java:490)
>         at org.apache.hadoop.hbase.io.hfile.HFileBlockIndex$BlockIndexReader.midkey(HFileBlockIndex.java:349)
>         at org.apache.hadoop.hbase.io.hfile.HFileReaderV2.midkey(HFileReaderV2.java:529)
>         at org.apache.hadoop.hbase.regionserver.StoreFile$Reader.midkey(StoreFile.java:1527)
>         at org.apache.hadoop.hbase.regionserver.StoreFile.getFileSplitPoint(StoreFile.java:684)
>         at org.apache.hadoop.hbase.regionserver.DefaultStoreFileManager.getSplitPoint(DefaultStoreFileManager.java:126)
>         at org.apache.hadoop.hbase.regionserver.HStore.getSplitPoint(HStore.java:1976)
>         at org.apache.hadoop.hbase.regionserver.RegionSplitPolicy.getSplitPoint(RegionSplitPolicy.java:82)
>         at org.apache.hadoop.hbase.regionserver.HRegion.checkSplit(HRegion.java:7614)
>         at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:521)
>         at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:471)
>         at org.apache.hadoop.hbase.regionserver.MemStoreFlusher.access$800(MemStoreFlusher.java:75)
>         at org.apache.hadoop.hbase.regionserver.MemStoreFlusher$FlushHandler.run(MemStoreFlusher.java:259)
>         at java.lang.Thread.run(Thread.java:756)
> {code}
> It is a quite serious bug. It may exsits from HFileV2 was invented. But no one has found out! Since this bug ONLY happens when finding a middlekey, and since we compare a rowkey from the left side, adding 12 bytes more to the right side is totally OK, no one cares!
> It even won't throw IndexOutOfBoundsException before HBASE-12297. since {{Arrays.copyOfRange}} is used, which will check the limit to ensue the length won't running past the end of the array.
>  But now, {{ByteBufferUtils.toBytes}} is used and IndexOutOfBoundsException will been thrown. 
> It happens in our production environment. Because of this bug, the region can't be split can grow bigger and bigger.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)