You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Jonathan Natkins (JIRA)" <ji...@apache.org> on 2013/05/10 06:15:15 UTC

[jira] [Updated] (HBASE-8521) Cells cannot be overwritten with bulk loaded HFiles

     [ https://issues.apache.org/jira/browse/HBASE-8521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jonathan Natkins updated HBASE-8521:
------------------------------------

    Attachment: hfileDirs.tar.gz

There are two directories in this tarball: familyDir1, and familyDir2. Each contains a single HFile, and each of them has one cell of data in them.

The table was created as:
create 'test', {NAME => 'myfam', VERSIONS => 100000, TTL => 1000000000}

In familyDir1, the HFile's cell contains the value "oldVal" for myfam:myqual.

In familyDir2, the HFile's cell contains the value "newVal" for myfam:myqual.
                
> Cells cannot be overwritten with bulk loaded HFiles
> ---------------------------------------------------
>
>                 Key: HBASE-8521
>                 URL: https://issues.apache.org/jira/browse/HBASE-8521
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.92.1
>            Reporter: Jonathan Natkins
>         Attachments: hfileDirs.tar.gz
>
>
> Let's say you have a pre-built HFile that contains a cell:
> ('rowkey1', 'family1', 'qual1', 1234L, 'value1')
> We bulk load this first HFile. Now, let's create a second HFile that contains a cell that overwrites the first:
> ('rowkey1', 'family1', 'qual1', 1234L, 'value2')
> That gets bulk loaded into the table, but the value that HBase bubbles up is still 'value1'.
> It seems that there's no way to overwrite a cell for a particular timestamp without an explicit put operation. This seems to be the case even after minor and major compactions happen.
> My guess is that this is pretty closely related to the sequence number work being done on the compaction algorithm via HBASE-7842, but I'm not sure if one of would fix the other.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira