You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "haosdent (JIRA)" <ji...@apache.org> on 2014/02/19 12:31:21 UTC
[jira] [Updated] (HBASE-9537) completebulkload does 'copy'
StoreFiles instead of 'cut'
[ https://issues.apache.org/jira/browse/HBASE-9537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
haosdent updated HBASE-9537:
----------------------------
Attachment: HBASE-9537.patch
> completebulkload does 'copy' StoreFiles instead of 'cut'
> --------------------------------------------------------
>
> Key: HBASE-9537
> URL: https://issues.apache.org/jira/browse/HBASE-9537
> Project: HBase
> Issue Type: Bug
> Components: HFile, mapreduce, regionserver
> Affects Versions: 0.94.11
> Reporter: M. BagherEsmaeily
> Attachments: HBASE-9537.patch, LoadIncrementalHFiles.log, region.log
>
>
> I was using HBase complete bulk load to transfer the output of ImportTsv to a table in HBase, and I noticed that it copies the output instead of cutting. This takes long time for my gigabytes of data.
> In HBase documentation (http://hbase.apache.org/book/ops_mgt.html#completebulkload) I read that the files would be moved not copied. Can anyone help me with this?
> I use Hbase 0.94.11 and Hadoop 1.2.1. The file system of bulkload output directory and hbase cluster are the same, too.
> I've also coded a MapReduce job using HFileOutputFormat. When I use LoadIncrementalHFiles to move the output of my job to HBase table, it still copies instead of cut.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)