You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2021/01/28 01:21:00 UTC
[jira] [Resolved] (IMPALA-3777) SqlParser parsed error for unicode
[ https://issues.apache.org/jira/browse/IMPALA-3777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Quanlong Huang resolved IMPALA-3777.
------------------------------------
Resolution: Cannot Reproduce
> SqlParser parsed error for unicode
> ----------------------------------
>
> Key: IMPALA-3777
> URL: https://issues.apache.org/jira/browse/IMPALA-3777
> Project: IMPALA
> Issue Type: Bug
> Components: Frontend
> Affects Versions: Impala 2.2.4
> Environment: CentOS 6.7 64 bit. impalad version 2.7.0-cdh5-INTERNAL DEBUG
> Reporter: Yuanhao Luo
> Priority: Minor
> Labels: correctness, downgraded
> Attachments: After calling SqlParser.parse.JPG, Before calling SqlParser.parse.JPG
>
>
> When I run query:create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##'; the field delimiter becomes to '\u0017##'.
> Logs:
> {noformat}
> [nobida147:21000] > create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##';
> Query: create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##'
> Fetched 0 row(s) in 242.44s
> [nobida147:21000] > describe extended unicode_parse_error;
> Query: describe extended unicode_parse_error
> +------------------------------+------------------------------------------------------------------+----------------------+
> | name | type | comment |
> +------------------------------+------------------------------------------------------------------+----------------------+
> | # col_name | data_type | comment |
> | | NULL | NULL |
> | id | int | NULL |
> | | NULL | NULL |
> | # Detailed Table Information | NULL | NULL |
> | Database: | db1 | NULL |
> | Owner: | root | NULL |
> | CreateTime: | Thu Jun 23 15:54:20 CST 2016 | NULL |
> | LastAccessTime: | UNKNOWN | NULL |
> | Protect Mode: | None | NULL |
> | Retention: | 0 | NULL |
> | Location: | hdfs://localhost:20500/test-warehouse/db1.db/unicode_parse_error | NULL |
> | Table Type: | MANAGED_TABLE | NULL |
> | Table Parameters: | NULL | NULL |
> | | transient_lastDdlTime | 1466668460 |
> | | NULL | NULL |
> | # Storage Information | NULL | NULL |
> | SerDe Library: | org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe | NULL |
> | InputFormat: | org.apache.hadoop.mapred.TextInputFormat | NULL |
> | OutputFormat: | org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat | NULL |
> | Compressed: | No | NULL |
> | Num Buckets: | 0 | NULL |
> | Bucket Columns: | [] | NULL |
> | Sort Columns: | [] | NULL |
> | Storage Desc Params: | NULL | NULL |
> | | field.delim | \u0017## |
> | | serialization.format | \u0017## |
> +------------------------------+------------------------------------------------------------------+----------------------+
> Fetched 27 row(s) in 4.77s
> {noformat}
> After debugging, it seems that SqlParser.parse() goes wrong. As attachment shows, before calling SqlParse.parse() the statement is: fields terminated by '\u0023##' , but after parsing, it becomes '\u0017##'
--
This message was sent by Atlassian Jira
(v8.3.4#803005)