You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Josh Elser (JIRA)" <ji...@apache.org> on 2017/05/17 21:43:04 UTC
[jira] [Created] (HBASE-18067) Support a default converter for data
read shell commands
Josh Elser created HBASE-18067:
----------------------------------
Summary: Support a default converter for data read shell commands
Key: HBASE-18067
URL: https://issues.apache.org/jira/browse/HBASE-18067
Project: HBase
Issue Type: Improvement
Components: shell
Reporter: Josh Elser
Assignee: Josh Elser
Priority: Minor
Fix For: 2.0.0
The {{get}} and {{scan}} shell commands have the ability to specify some complicated syntax on how to encode the bytes read from HBase on a per-column basis. By default, bytes falling outside of a limited range of ASCII are just printed as hex.
It seems like the intent of these converts was to support conversion of certain numeric columns as a readable string (e.g. 1234).
However, if non-ascii encoded bytes are stored in the table (e.g. UTF-8 encoded bytes), we may want to treat all data we read as UTF-8 instead (e.g. if row+column+value are in Chinese). It would be onerous to require users to enumerate every column they're reading to parse as UTF-8 instead of the limited ascii range. We can provide an option to encode all values retrieved by the command.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)