You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2021/01/28 01:21:00 UTC

[jira] [Resolved] (IMPALA-3777) SqlParser parsed error for unicode

     [ https://issues.apache.org/jira/browse/IMPALA-3777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang resolved IMPALA-3777.
------------------------------------
    Resolution: Cannot Reproduce

> SqlParser parsed error for unicode
> ----------------------------------
>
>                 Key: IMPALA-3777
>                 URL: https://issues.apache.org/jira/browse/IMPALA-3777
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 2.2.4
>         Environment: CentOS 6.7 64 bit. impalad version 2.7.0-cdh5-INTERNAL DEBUG
>            Reporter: Yuanhao Luo
>            Priority: Minor
>              Labels: correctness, downgraded
>         Attachments: After calling SqlParser.parse.JPG, Before calling SqlParser.parse.JPG
>
>
> When I run query:create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##'; the field delimiter becomes to '\u0017##'.
> Logs:
> {noformat}
> [nobida147:21000] > create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##';
> Query: create table unicode_parse_error(id int) row format delimited fields terminated by '\u0023##'
> Fetched 0 row(s) in 242.44s
> [nobida147:21000] > describe extended unicode_parse_error;
> Query: describe extended unicode_parse_error
> +------------------------------+------------------------------------------------------------------+----------------------+
> | name                         | type                                                             | comment              |
> +------------------------------+------------------------------------------------------------------+----------------------+
> | # col_name                   | data_type                                                        | comment              |
> |                              | NULL                                                             | NULL                 |
> | id                           | int                                                              | NULL                 |
> |                              | NULL                                                             | NULL                 |
> | # Detailed Table Information | NULL                                                             | NULL                 |
> | Database:                    | db1                                                              | NULL                 |
> | Owner:                       | root                                                             | NULL                 |
> | CreateTime:                  | Thu Jun 23 15:54:20 CST 2016                                     | NULL                 |
> | LastAccessTime:              | UNKNOWN                                                          | NULL                 |
> | Protect Mode:                | None                                                             | NULL                 |
> | Retention:                   | 0                                                                | NULL                 |
> | Location:                    | hdfs://localhost:20500/test-warehouse/db1.db/unicode_parse_error | NULL                 |
> | Table Type:                  | MANAGED_TABLE                                                    | NULL                 |
> | Table Parameters:            | NULL                                                             | NULL                 |
> |                              | transient_lastDdlTime                                            | 1466668460           |
> |                              | NULL                                                             | NULL                 |
> | # Storage Information        | NULL                                                             | NULL                 |
> | SerDe Library:               | org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe               | NULL                 |
> | InputFormat:                 | org.apache.hadoop.mapred.TextInputFormat                         | NULL                 |
> | OutputFormat:                | org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat       | NULL                 |
> | Compressed:                  | No                                                               | NULL                 |
> | Num Buckets:                 | 0                                                                | NULL                 |
> | Bucket Columns:              | []                                                               | NULL                 |
> | Sort Columns:                | []                                                               | NULL                 |
> | Storage Desc Params:         | NULL                                                             | NULL                 |
> |                              | field.delim                                                      | \u0017##             |
> |                              | serialization.format                                             | \u0017##             |
> +------------------------------+------------------------------------------------------------------+----------------------+
> Fetched 27 row(s) in 4.77s
> {noformat}
> After debugging, it seems that SqlParser.parse() goes wrong. As attachment shows, before calling SqlParse.parse() the statement is: fields terminated by '\u0023##' , but after parsing, it becomes '\u0017##'



--
This message was sent by Atlassian Jira
(v8.3.4#803005)