You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Yubao Liu (Jira)" <ji...@apache.org> on 2020/12/04 04:14:00 UTC
[jira] [Created] (HBASE-25357) allow specifying binary row key
range to pre-split regions
Yubao Liu created HBASE-25357:
---------------------------------
Summary: allow specifying binary row key range to pre-split regions
Key: HBASE-25357
URL: https://issues.apache.org/jira/browse/HBASE-25357
Project: HBase
Issue Type: Improvement
Components: spark
Reporter: Yubao Liu
Currently, spark hbase connector use `String` to specify regionStart and regionEnd, but we often have serialized binary row key, I made a little patch at [https://github.com/apache/hbase-connectors/pull/72/files] to always treat the `String` in ISO_8859_1, so we can put raw bytes into the String object and get it unchanged.
This has a drawback, if your row key is really UTF-8 strings, you should convert it to UTF-8 encoded bytes and then encapsulate it in ISO_8859_1 string. This is a limitation of Spark option interface which allows only string to string map.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)