You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Sean Busbey (JIRA)" <ji...@apache.org> on 2015/09/06 01:04:46 UTC
[jira] [Resolved] (HBASE-14339) HBase Bulk Load and super wide rows
[ https://issues.apache.org/jira/browse/HBASE-14339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Busbey resolved HBASE-14339.
---------------------------------
Resolution: Duplicate
> HBase Bulk Load and super wide rows
> -----------------------------------
>
> Key: HBASE-14339
> URL: https://issues.apache.org/jira/browse/HBASE-14339
> Project: HBase
> Issue Type: Bug
> Reporter: Ted Malaska
> Priority: Minor
>
> This may not be a huge issues but it does come up. If the number of columns in a row are to many then KeyValueSortReducer will blow up with a out of memory exception, because it uses a TreeMap to sort the columns with in the memory of the reducer.
> A solution would be to add the column family and qualifier to the key so the shuffle would handle the sort.
> The partitioner would only partition on the rowKey but ordering would apply to the RowKey, Column Family, and Column Qualifier.
> Look at the Spark Bulk load as an example. HBASE-14150
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)