You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Suresh Subbiah (JIRA)" <ji...@apache.org> on 2015/10/22 00:48:27 UTC

[jira] [Assigned] (TRAFODION-1550) Bulk load performance can be improved by increasing HBASE_ROWSET_VSBB_SIZE

     [ https://issues.apache.org/jira/browse/TRAFODION-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Suresh Subbiah reassigned TRAFODION-1550:
-----------------------------------------

    Assignee: Suresh Subbiah

> Bulk load performance can be improved by increasing HBASE_ROWSET_VSBB_SIZE
> --------------------------------------------------------------------------
>
>                 Key: TRAFODION-1550
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-1550
>             Project: Apache Trafodion
>          Issue Type: Improvement
>          Components: sql-general
>            Reporter: Suresh Subbiah
>            Assignee: Suresh Subbiah
>             Fix For: 1.2-incubating
>
>
> Bulk load flushes rows to HFile in batches of size HBASE_ROWSET_VSBB_SIZE. The default value for this cqd is 1024. Aflush size of 1024 rows is small, particularly for narrow tables like TPC-H lineitem (~150 bytes per row).
> Increasing the flush size to 10,000 or 20,000 rows caused performance to improve by more than 100%. Please configure code to determine a more ideal flush size for a given table.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)