You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Paul Rogers (JIRA)" <ji...@apache.org> on 2017/09/28 23:34:00 UTC
[jira] [Created] (DRILL-5825) HBase storage plugin assumes local
system encoding for column names
Paul Rogers created DRILL-5825:
----------------------------------
Summary: HBase storage plugin assumes local system encoding for column names
Key: DRILL-5825
URL: https://issues.apache.org/jira/browse/DRILL-5825
Project: Apache Drill
Issue Type: Bug
Affects Versions: 1.11.0
Reporter: Paul Rogers
The HBase storage plugin has the ability to expand wildcard columns at plan time as part of projection push-down. HBase column names are encoded in binary. Drill decodes the names using the default character set of the machine running Drill. While this works fine for ASCII names, there is no reason to believe it will work for non-ASCII names if the machine running Drill is different than the machine on which the column name was defined.
Offending code:
{code}
public class DrillHBaseTable extends DrillTable implements DrillHBaseConstants {
...
fieldNameList.add(Bytes.toString(family));
{code}
From {{String}} Javadoc:
bq. String(byte[] bytes): Constructs a new String by decoding the specified array of bytes using the *platform's* default charset.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)