You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "haosdent (JIRA)" <ji...@apache.org> on 2014/02/19 12:31:21 UTC

[jira] [Updated] (HBASE-9537) completebulkload does 'copy' StoreFiles instead of 'cut'

     [ https://issues.apache.org/jira/browse/HBASE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

haosdent updated HBASE-9537:
----------------------------

    Attachment: HBASE-9537.patch

> completebulkload does 'copy' StoreFiles instead of 'cut'
> --------------------------------------------------------
>
>                 Key: HBASE-9537
>                 URL: https://issues.apache.org/jira/browse/HBASE-9537
>             Project: HBase
>          Issue Type: Bug
>          Components: HFile, mapreduce, regionserver
>    Affects Versions: 0.94.11
>            Reporter: M. BagherEsmaeily
>         Attachments: HBASE-9537.patch, LoadIncrementalHFiles.log, region.log
>
>
> I was using HBase complete bulk load to transfer the output of ImportTsv to a table in HBase, and I noticed that it copies the output instead of cutting. This takes long time for my gigabytes of data.
> In HBase documentation (http://hbase.apache.org/book/ops_mgt.html#completebulkload) I read that the files would be moved not copied. Can anyone help me with this?
> I use Hbase 0.94.11 and Hadoop 1.2.1. The file system of bulkload output directory and hbase cluster are the same, too.
> I've also coded a MapReduce job using HFileOutputFormat. When I use LoadIncrementalHFiles to move the output of my job to HBase table, it still copies instead of cut.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)