You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Andrew Purtell (JIRA)" <ji...@apache.org> on 2015/08/13 04:04:46 UTC
[jira] [Resolved] (HBASE-13985) Add configuration to skip
validating HFile format when bulk loading
[ https://issues.apache.org/jira/browse/HBASE-13985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrew Purtell resolved HBASE-13985.
------------------------------------
Resolution: Fixed
Hadoop Flags: Reviewed
Fix Version/s: 1.3.0
1.2.0
2.0.0
Pushed to 0.98, branch-1.2, branch-1, and master. Bulk load units pass on all branches.
> Add configuration to skip validating HFile format when bulk loading
> -------------------------------------------------------------------
>
> Key: HBASE-13985
> URL: https://issues.apache.org/jira/browse/HBASE-13985
> Project: HBase
> Issue Type: Improvement
> Affects Versions: 0.98.13
> Reporter: Victor Xu
> Assignee: Victor Xu
> Priority: Minor
> Labels: regionserver
> Fix For: 2.0.0, 0.98.14, 1.2.0, 1.3.0
>
> Attachments: HBASE-13985-v2.patch, HBASE-13985-v3.patch, HBASE-13985.patch
>
>
> When bulk loading millions of HFile into one HTable, checking HFile format is the most time-consuming phase. Maybe we could use a parallel mechanism to increase the speed, but when it comes to millions of HFiles, it may still cost dozens of minutes. So I think it's necessary to add an option for advanced user to bulkload without checking HFile format at all.
> Of course, the default value of this option should be true.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)