You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hbase.apache.org by "Yi Liang (JIRA)" <ji...@apache.org> on 2017/04/11 22:39:42 UTC

[jira] [Updated] (HBASE-17905) [hbase-spark] bulkload does not work when table not exist

     [ https://issues.apache.org/jira/browse/HBASE-17905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yi Liang updated HBASE-17905:
-----------------------------
    Attachment: HBASE-17905-V1.patch

The reason why bulkload fail when table not exist is the {code}BulkLoadPatitioner#numPartitions {code} will return 0 if table not exist, if partition number equals 0, it means that spark repartitionAndSortWithinPartitions(Partitioner) will return nothing, and the following transformation will not be executed

This patch fix the errors for bulkload fail when table not exist, and also add some log information, I wonder if we can also add a BulkLoad API that do not have tablename as parameter 

> [hbase-spark]  bulkload does not work when table not exist
> ----------------------------------------------------------
>
>                 Key: HBASE-17905
>                 URL: https://issues.apache.org/jira/browse/HBASE-17905
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Yi Liang
>            Assignee: Yi Liang
>         Attachments: HBASE-17905-V1.patch
>
>
> when using HBase-Spark bulkload api, an argument of tablename is needed, the bulkload can run successfully only if  table exist in HBase.  If table not exist, the bulkload can not run successfully and it even do not report any errors or throw exception. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)