You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "jiraposter@reviews.apache.org (JIRA)" <ji...@apache.org> on 2011/07/30 20:42:11 UTC

[jira] [Commented] (HBASE-4148) HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata

    [ https://issues.apache.org/jira/browse/HBASE-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13073230#comment-13073230 ] 

jiraposter@reviews.apache.org commented on HBASE-4148:
------------------------------------------------------


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1229/
-----------------------------------------------------------

Review request for hbase and Todd Lipcon.


Summary
-------

When HFiles are flushed through the normal path, they include an attribute TIMERANGE_KEY which can be used to cull HFiles when performing a time-restricted scan. Files produced by HFileOutputFormat are currently missing this metadata.


This addresses bug HBASE-4148.
    https://issues.apache.org/jira/browse/HBASE-4148


Diffs
-----

  src/main/java/org/apache/hadoop/hbase/mapreduce/HFileOutputFormat.java 8ccdf4d 
  src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java 40efdda 
  src/test/java/org/apache/hadoop/hbase/mapreduce/TestHFileOutputFormat.java 89241eb 

Diff: https://reviews.apache.org/r/1229/diff


Testing
-------

Added unit test.  

I don't quite understand why the KeyValue with the larger timestamp (2000) value must be written before the one with the smaller timestamp (1000). I can see the code that enforces this (HFile.checkKey) but not why keys are larger to smaller.  Is this in HFile data precondition?

I cannot get the full test suite to pass, with or without this patch.  Suite seems to timeout on tests unrelated to this.  Would appreciate some hints or pointers for info on which tests are flakey or take a long time to run.


Thanks,

jmhsieh



> HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata
> --------------------------------------------------------
>
>                 Key: HBASE-4148
>                 URL: https://issues.apache.org/jira/browse/HBASE-4148
>             Project: HBase
>          Issue Type: Bug
>          Components: mapreduce
>    Affects Versions: 0.90.3
>            Reporter: Todd Lipcon
>            Assignee: Jonathan Hsieh
>             Fix For: 0.90.5
>
>         Attachments: 0001-HBASE-4148.-HFileOutputFormat-doesn-t-fill-in-TIMERA.patch
>
>
> When HFiles are flushed through the normal path, they include an attribute TIMERANGE_KEY which can be used to cull HFiles when performing a time-restricted scan. Files produced by HFileOutputFormat are currently missing this metadata.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira